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RECOMBINANT HK2 POLYPEPTIDE 



5 Background of the Invention 

The glandular kallikreins are a subgroup of serine proteases 
which are involved in the post-translational processing of specific polypeptide 
precursors to their biologically active forms. The human kallikrein gene 
family consists of three members: prostate-specific antigen, human glandular 

10 kallikrein, and pancreatic/renal kallikrein. See J.A Clements, Endocr. Rev. . 
10, 393 (1989) and T.M Chu et al. (U.S. Patent No. 4,446,122). A common 
nomenclature for these members of the tissue (glandular) kallikrein gene 
families was recently adopted by T. Berg et al, in Recent Progres s on Kinins: 
Biochemistry and Molecula r Biology of the Kallikrein-Kinin System. Agate 

15 and Actions Supplements. Vol. I.. H. Fritz et al.., eds., Birkkauser Verlag, 
Basel (1992), and is defined in Table I, below. 
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The DNA sequence homology between hKLK2 and hKLK3 
(exon regions) is 80%, whereas the homology between hKLK2 and hKLKl is 
65%. The deduced amino acid sequence homology of hK2 to hKl is 57% 
Amino acid sequences deduced by LJ. Schedlich et al., DNA. 6, 429 (1987) 
5 and B J. Morris, Clin. Exp. Pharmacol. Phvsiol. Ifi, 345 (1989) indicate that 
hK2 may be a tryps in-like serine protease, whereas hK3 (PSA) is a 
chymotrypsin-like serine protease. Therefore, if hK2 is indeed secretory, it 
may have a different physiological function than hK3. 

The hKLK2 gene is located about 12 kbp downstream from the 

10 hKLK3 gene in a head-to-tail fashion on chromosome 19. (P.H. Riegman et 
al., FEBS Lett. 247. 123, (1989)). The similarities of gene structure and 
deduced amino acid sequences of these human kallikreins suggest that their 
evolution may involve the same ancestral gene. Most interestingly, as 
reported by Morris, cited apa; P. Chapdelaine, FEBS Lett.. 236. 205 (1988); 

15 and Young, Biochemistry, 31 r 1952 (1992), both hK2 and hK3 may be 
expressed only in the human prostate, while expression of hKl is limited to 
the pancreas, submandibular gland, kidney, and other nonprostate tissues. 

Tremendous interest has been generated in hK3 (PSA) because 
of the important role it plays as a marker to detect and to monitor progression 

20 of prostate carcinoma. Its usefulness as a marker is based on the elevated 
serum concentration of circulating hK3 proteins which are frequently 
associated with prostatic cancer. The serum concentration of hK3 has been 
found to be proportional to the cancer mass in untreated patients, but is also 
proportional to the volume of hyperplastic tissue in patients with benign 

25 prostatic hyperplasia (BPH). Hie serum levels of hK3 become reduced 
following prostate cancer therapy. 

Despite the iriformation which can be ascertained about hK2 
from the genomic DNA sequence, very little is known about the hK2 
polypeptide itself. The reason for this is that the protein has not been purified 

30 and characterized. Thus, a need exists for a method to obtain hK2 

polypeptide and related polypeptides in sufficient quantity and purity for 
characterization and for use as merapeutic/diagnostic agents or reagents. 
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Summary of the Invention 
The present invention provides an isolated, substantially 
homogenous hK2 polypeptide. As used herein, in the term "hK2 polypeptide" 
includes pre-pro hK2, pro hK2 and mature hK2 polypeptides. Pre-pro hK2 is 
5 secreted by the cell in vivo, and is cleaved during secretion to yield pro hK2, 
which is then enzymatically cleaved in the extracellular environment to yield 
''mature'' hK2. Most preferably, the hK2 polypeptide is contiguous in amino 
acid sequence with SEQ ID NO: 16, SEQIDNO: 6 or SEQ ID NO: 10. 

The present invention also provides isolated nucleic acid 
10 molecules encoding hK2 polypeptide, including (a) a cDNA molecule 

comprising the nucleotide sequence of the coding region of the hK2 gene; (b) 
a DNA molecule capable of hybridizing under stringent conditions to a 
nucleotide sequence complementary to the nucleotide sequence of (a); and (c) 
a genetic variant of any of the DNA molecules of (a) and (b) which encodes 
1 5 of polypeptide processing an antigenic function of naturally cccurring hK2 
polypeptide. Preferably, the nucleic acid comprises a discrete, isolated DNA 
or RNA molecule encoding the complete hK2 polypeptide, which can include 
the pre-pro, pro or rnature forms. Most preferably, the nucleic acid is a DNA 
sequence contiguous with SEQ ID NO: 5, 7 or 9, i.e., as shown in Figs. 5, 6 
20 or 7. These DNA sequences can be produced using the polymerase chain 
reaction (PCR), and novel oligonucleotide primers employed in the synthesis 
are also an embodiment of the invention. 

The nucleic acid sequence also can comprise a promoter 
operably linked to the nucleic acid sequence. Therefore, the invention also 
25 comprises a chimeric expression vector comprising the above-described 

nucleic acid sequence, operationally linked to control sequences recognized by 
a host cell transformed with the vector, as well as said transformed host cell, 
and methods of its preparation and use to produce recombinant hK2. Thus, 
the present invention also provides a method of using a nucleic acid molecule, 
30 such as a cDNA clone encoding hK2 polypeptide, comprising expressing the 
nucleic acid molecule in a cultured host cell transformed, preferably stably 
transformed, with a chimeric expression vector comprising said nucleic acid 
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molecule operably linked to control sequences recognized by the host cell 
transformed with the vector; and recovering the hK2 polypeptide from the 
transgenic host cell, i.e., from the culture medium. As used herein, the term 
"chimeric" means that the vector comprises DNA from at least two different 
5 species, or comprises DNA from the same species, which is linked or 
associated in a manner which does not occur in the "native" form of said 
species. 

More specifically, E. coli and baculovirus insect cells systems 
have been employed to produce hK2 polypeptides in two forms, i.e. pre-pro 

10 hK2 (pphK2) and mature hK2 (mhK2). Thus, the present invention provides 
the first example of the overexpression of hK2 in heterologous systems. 
However, although pphK2 produced in E. coli has proven to be an invaluable 
resource for generating antibodies to the denatured form of the protein, it is 
desirable to both discern the steps involved in the biosynthesis of hK2 and to 

15 obtain antibodies specific for the fully processed and secreted form of the 
protein. Therefore, mammalian cell systems have been employed to produce 
hK2 polypeptides. Thus, the present invention also provides the first example 
of the expression of hK2 in mammalian cells and purification and 
characterization of the secreted protein. 

20 The high degree of amino acid sequence homology of hK2 with 

hK3 indicates that measuring serum concentrations of both proteins may be 
useful in the diagnosis and monitoring of prostate cancer. For example, the 
antibodies developed against hK3 now used in these assays could theoretically 
also recognize hK2, because of mutual contamination in the antigenic 

25 preparations used to develop the anti-hK3 antibodies or because of antibody 
cross-reactivity between these two proteins. This could account for the 
substantial percentage of false positive results which are observed in current 
hK3 assays. On the other hand, if circulating hK2 levels are also elevated 
above baseline levels in prostate cancer patients, detection of hK2 by hK2- 

30 specific antibodies could provide an alternative, confirmatory assay for 
prostate cancer. 
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Therefore, hK2 polypeptide, as well as variants and subunits 
thereof; produced by the present method can be used to produce populations 
of antibodies that, in turn, can be used as the basis for assays to detect and 
quantify hK2 polypeptide (or "protein") in samples derived from tissues such 

5 as prostate carcinomas, cells such as prostate cell lines, or from fluids such as 
seminal fluid or blood. Thus, the present invention also provides populations 
of monoclonal or polyclonal antibodies that specifically bind to hK2 
polypeptide, while not significantly binding to hK3. The term "significantly" 
is defined by reference to the comparative assays discussed below. These 

10 antibodies can also be used in affinity chromatography, to purify mammalian 
hK2 from natural sources. The isolated, substantially homogeneous hK2 can 
also be employed as a component in diagnostic assays for "native" hK2 in 
samples derived from human tissues or physiological fluids. For example, the 
recombinant hK2 can be bound to a detectable label and employed in 

15 competitive immunoassays for hK2, as described in U.S. patent application 
Serial No. 08/096,946, filed July 22, 1993. 

As used herein with respect to the present invention, the terms 
"hK2 polypeptide," "hK2 protein," and "hK2" are considered to refer to 
identical human materials, unless otherwise indicated 

20 

Brief Description of the Figures 
Figure 1 depicts a time course study of recombinant pphK2 in 
sfi cells infected with recombinant pphK2 virus. At each of the time points 
cells were depleted of methionine and cysteine for 1 hour in deficient media 
25 and then supplemented with [ 35 S]-methione and [ 35 S]-cysteine. Protein was 
determined by Bradford assay. Aliquots of protein (20 jig) were loaded onto 
a 12% Tris-Glycine SDS gel. A Phosphorimager cassette was exposed 
overnight The band of interest is indicated with an arrow, w.t: wild type. 

Figure 2 depicts the detection of recombinant mhK2 in cell 
30 lysate fractions. Sfi cells were infected either with recombinant mhK2, wild 
type or left iminfected for 48 hours. Methionine and cysteine pools were 
depleted for 1 hour in deficient media. Cells were supplemented with [ 35 S]- 
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methionine and [ 3S S]-cysteine for 6 hours. Cells were separated into soluble 
and insoluble fractions using H 2 0 and repeated freeze/thaw conditions. 
Aliquots of protein (50 ug per lane) were loaded onto a 10% Tris-Glycine 
SDS gel and electrophoresed. The gel was dried and exposed to x-ray film 
5 for 2 days. The band of interest is indicated with an arrow. 

Figure 3 depicts the expression of recombinant pphK2 in £. 
coli. E. coli strain BL21 (DE3) LysS harboring pBppHK2 was grown in LB 
media to O-D.^ 0.2 and incubated without (lane 2, not-induced (N)) or with 
(lane 3, induced (I)) 0.4 mM IPTG for 2 hrs. Cells were lysed in sample 

10 buffer and subjected to SDS/PAGE on a 4-20% gradient gel. Protein bands 
were visualized by staining the gel with Coomassie blue. 

Figure 4 depicts the amino acid sequences of mature hK2 
(deduced from cDNA sequence, SEQ ID NO: 16) and hK3 (SEQ ID NO: 1). 
Underlined sequences denote nonhomologous regions that can be used for 

15 preparation of antibodies specific to hK2. 

Figure 5 depicts pphK2 cDNA containing a BamHl site at the 
5' end and a Pstl site at the 3' end (SEQ ID NO: 5) (coding strand is 
numbered) as well as the amino acid sequence of pre-pro hK2 encoded 
thereby (SEQ ID NO: 6). The amino acid sequences of pro hK2 and mature 

20 hK2 are also shown on the Figure. 

Figure 6 depicts mhK2 cDNA containing an EcoRl site at the 
5' end and Pstl site at the 3' end (SEQ ID NO: 7), as well as the 
ccnresponding amino acid sequence (SEQ ID NO: 8) which encompasses the 
amino acid sequence of mhK2 polypeptide. 

25 Figure 7 depicts pro hK2 DNA (SEQ ID NO: 9) (coding strand 

is numbered) and the amino acid sequence of pro hK2 (SEQ ID NO: 10). 

Figure 8 depicts a gel confirming the expression of recombinant 
pphK2 in a mammalian cell line. AV12-pGThK2 (Lane 4-6) and AV12-pGT- 
d (Lane 3) clonal cell lines were grown in D10F media About 300ul of 

30 spent medium from the above clones were concentrated and subjected to 
SDS/PAGE along with See Blue MW marker (lane 1) and pphK2 lysate from 
E. coli cells (lane 2). The gel was blotted onto nitrocellulose paper and 
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immunoblotted using a 1/1000 dilution of anti-pphK2 rabbit antiserum. HRP- 
goat anti-rabbit was used as the secondary probe and the blot was developed 
by DAB plus R 2 ^ Lane 3 (AV12-pGT-d) is AV12 transfected with vector 
without insert 

5 Figure 9 depicts the DEAE chromatography of AV12 media 

The sample was applied in a bicarbonte buffer, pH 8 and eluted with a salt 
gradient The solid line is the A^ elution profile. The triangle line 
represents the FT ISA assay of individual samples which had been dried onto 
microtiter plates and developed with rabbit anti-hK2 antibody. 

10 Figure 10 depicts the hydrophobic interaction profile of DEAE 

fractions. The fractions were pooled, concentrated and applied to an HIC 
column in 1.2 M sodium sulfate, and eluted with a decreasing salt gradient. 
The solid line is A^ and the triangle line shows the ELISA assay profile of 
the fractions using rabbit anti-hK2 antibody. 

15 Figure 1 1 depicts the Size Exclusion Chromatography of HIC 

purified prohK2, in particular, the A 280 profile of 22 min peak eluted off HIC 
column. The 19.4 min peak appears homogeneous by SDS-PAGE. After this 
peak was lyophilized, the N-terminal sequence and amino acid composition 
confirmed its identity as the pro form of hK2. 

20 Figure 12 depicts the SDS/PAGE analysis of prohK2 and PSA. 

1.5ug of purified phK2 or PSA was boiled in sample buffer containing (R) or 
not containing (N) 1% BME. Samples were subjected to SDS/PAGE on a 4- 
20% gel. The protein bands were visualized by staining the gel with silver. 

25 Detailed Desc ription of the Invention 

As used herein, the term "hK2 polypeptide" preferably 
encompasses the recombinant pre-pro, pro and mature hK2 polypeptides. As 
proposed herein, a mature hK2 polypeptide having the amino acid sequence 
shown in Fig. 4 (SEQ ID NO: 16), as well as "variant" polypeptides which 

30 share at least 90% homology with SEQ ID NO: 16 in the regions which are 
substantially homologous with hK3, i.e., which regions are not identified by 
bars as shown in Fig. 4. Such hK2 polypeptides also possess antigenic 
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function in common with the mature hK2 molecule of Fig. 4, in that said 
polypeptides are also definable by antibodies which bind specifically thereto, 
but which do not cross-react with hK3 (or hKl). Preferably, said antibodies 
react with antigenic sites or epitopes that are also present on the mature hK2 
5 molecule of Fig. 4. Antibodies useful to define common antigenic function 
are described in detail in Ser. No. 08/096,946, i.e., polyclonal antisera 
prepared in vivo against hK2 submit 41-56. 

"Isolated hK2 nucleic acid" is RNA or DNA containing greater 
than 15, preferably 20 or more, sequential nucleotide bases that encode a 

10 biologically active hK2 polypeptide or a variant fragment thereof; that is 
complementary to the non-coding strand of the native hK2 polypeptide RNA 
or DNA, or hybridizes to said RNA or DNA and remains stably bound under 
stringent conditions. Thus, the RNA or DNA is isolated in that it is tree from 
at least one contaminating nucleic acid with which it is normally associated in 

15 the natural source and is preferably substantially free of any other marnrnalian 
RNA or DNA. Hie phrase "free from at least one contaminating source 
nucleic acid with which it is normally associated" includes the case where the 
nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 

20 normally found in the source cell. An example of isolated hK2 nucleic acid is 
RNA or DNA that encodes a biologically active hK2 polypeptide sharing at 
least 90% sequence identity with the hK3-homologous regions of the hK2 
peptide of Fig. 4, as described above. Hie term "isolated, substantially 
homogenous" as used with respect to an hK2 polypeptide is defined in terms 

25 of the methodologies discussed herein below. 

As used herein, the term "recombinant nucleic acid," i.e., 
"recombinant DNA" refers to a nucleic acid, i.e., to DNA that has been 
derived or isolated from any appropriate tissue source, that may be 
subsequently chemically altered in vitro, an later introduced into target host 

30 cells, such as cells derived from animal, plant, insect, yeast, fungal or 

bacterial sources. An example of recombinant DNA "derived" from a source, 
would be a DNA sequence that is identified as a useful fragment encoding 
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hK2, or a fragment or variant thereof, and which is then chemically 
synthesized in essentially pure form. An example of such DNA "isolated" 
from a source would be a useful DNA sequence that is excised or removed 
from said source by chemical means, e.g, by the use of restriction 
5 endonucleases, so that it can be further manipulated, e.g., amplified, for use 
the invention, by the methodology of genetic engineering. 

Therefore, "recombinant DNA" includes completely synthetic 
DNA sequences, semi-synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
10 sequences derived from introduced RNA, as well as mixtures thereof. 
Generally, the recombinant DNA sequence is not originally resident in the 
genome of the host target cell which is the recipient of the DNA, or it is 
resident in the genome but is not expressed 

The recombinant DNA sequence, used for transformation 
1 5 herein, may be circular or linear, double-stranded or single-stranded. 
Generally, the DNA sequence is in the form of chimeric DNA, such as 
plasmid DNA, that can also contain coding regions flanked by control 
sequences which promote the expression of the recombinant DNA present in 
the resultant cell line. For example, the recombinant DNA may itself 
20 comprise a promoter that is active in mammalian cells, or may utilize a 
promoter already present in the genome that is the transformation target 
Such promoters include the CMV promoter, as well as the SV 40 late 
promoter and retroviral LTOs (long terminal repeat elements). Aside mom 
recombinant DNA sequences that serve as transcription units for hK2 or 
25 portions thereof, a portion of the recombinant DNA may be untranscribed, 
serving a regulatory or a structural function. 

"Control sequences" is defined to mean DNA sequences 
necessary for the expression of an operably linked coding sequence in a 
particular host organism The control sequences that are suitable for 
30 prokaryotic cells, for example, include a promoter, and optionally an operator 
sequence, and a ribosome binding site. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 
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"Operably linked" is defined to mean that the nucleic acids are 
placed in a functional relationship with another nucleic acid sequence. For 
example, DNA for a presequence or secretory leader is operably linked to 
DNA for a polypeptide if it is expressed as a preprotein that participates in 
5 the secretion of the polypeptide; a promoter or enhancer is operably linked to 
a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translatioa Generally, "operably linked" means that the DNA 
sequences being linked are contiguous and, in the case of a secretory leader, 
10 contiguous and in reading phase. However, enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction 
sites. If such sites do not exist, the synthetic oligonucleotide adaptors or 
linkers are used in accord with conventional practice. 

Aside from recombinant DNA sequences that serve as 
15 transcription units for hK2 or portions thereof; a portion of the recombinant 
DNA may be untranscribed, serving a regulatory or a structural functioa 

The recombinant DNA to be introduced into the cells further 
will generally contain either a selectable marker gene or a reporter gene or 
both to facilitate identification and selection of transformed cells from the 
20 population of cells sought to be transformed. Alternatively, the selectable 
marker may be carried on a separate piece of DNA and used in a co- 
trarisforrnation procedure. Both selectable markers and reporter genes may be 
flanked with appropriate regulatory sequences to enable expression in the host 
cells. Useful selectable markers are well known in the art and include, for 
25 example, antibiotic and herbicide-resistance genes, such as nSQ, hpt dhiL hat 
am A. gjpA and the like. 

Reporter genes are used for identifying potentially transformed 
cells and for evaluating the functionality of regulatory sequences. Reporter 
genes which encode for easily assayable proteins are well known in the art 
30 In general, a reporter gene is a gene which is not present in or expressed by 
the recipient organism or tissue and which encodes a protein whose expression 
is manifested by some easily detectable property, e.g., enrymatic activity. 
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Preferred genes include the chloramphenicol acetyl transferase gene (cat) from 
Tn9 of E. coli. the beta-glucuronidase gene (gus) of the uidA locus of E_£Qli, 
and the luciferase gene from firefly Photinus pyralis. Expression of the 
reporter gene is assayed at a suitable time after the DNA has been introduced 

5 into the recipient cells- 
Other elements functional in the host cells, such as introns, 
enhancers, polyadenylation sequences and the like, may also be a part of the 
recombinant DNA- Such elements may or may not be necessary for the 
function of the DNA, but may provide improved expression of the DNA by 

10 affecting transcription, stability of the mRNA, or the like. Such elements may 
be included in the DNA as desired to obtain the optimal performance of the 
traraforniing DNA in the cell. 

The general methods for constructing recombinant DNA which 
can transform target cells are well known to those skilled in the art, and the 

15 same compositions and methods of construction may be utilized to produce 
the DNA useful herein. For example, J. Sambrook et al., Molecular Cloning : 
A T.ahoratorv Manual. Cold Spring Harbor Laboratory Press (2d ed, 1989), 
provides suitable methods of construction. 

The recombinant DNA can be readily introduced into the target 

20 cells by transfection with an expression vector comprising cDNA encoding 
hK2, for example, by the modified calcium phosphate precipitation procedure 
of C. Chen et al., Mol. Cell. Biol.. Z 2745 (1987). Transfection can also be 
accomplished by lipofectin, using commercially available kits, e.g., provided 
by BRL. 

25 Suitable host cells for the expression of hK2 polypeptide are 

derived from multicellular organisms. Such host cells are capable of complex 
processing and glycosylation activities. In principle, any higher eukaryotic 
cell culture is workable, whether from vertebrate or invertebrate culture. 
Examples of invertebrate cells include plant and insect cells. Numerous 

30 baculoviral strains and variants and corresponding permissive insect host cells 
from hosts such as Spodoptera frugiperda (caterpillar), Aedes aegypti 
(mosquito), Aedes dbopictus (mosquito), DrosojMa melanogqster (fruitfly), 
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and Bombyx mori have been identified See, e.g., Luckow et al., 
Bio/Technology , & 47 (1988); Miller et al., in Genetic Engineering, J. K. 
Setlow et al., eds., Vol 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda 
et al., Nature. 315:592 (1985). A variety of viral strains for transfection are 

5 publicly available, e.g., the L-l variant of Autogrvpha cdifomica NPV and the 
Bm-5 strain of Bombyx mori NPV, and such viruses may be used, preferably 
for transfection of Spodopteru frugiperda cells. 

Recovery or isolation of a given fragment of DNA from a 
restriction digest can empoly separation of the digest on polyacrylamide or 

10 agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known 
molecular weight, removal of the gel section containing the desired fragment, 
and separation of the gel from DNA For example, see Lawn et al., Nucleic 
Acids Res. . 2, 6103-6114 (1981), and Goeddel et al„ Nucleic Acids Res.. & 

15 4057(1980). 

"Southern analysis" or "Southern blotting" is a method by 
which the presence of DNA sequences in a restriction endonuclease digest of 
DNA or DNA-containing composition is confirmed by hybridization to a 
known, labeled oligonucleotide or DNA fragment Southern analysis typically 

20 involves electrophoretic separation of DNA digests on agarose gels, 

denaturation of the DNA after electrophoretic separation, and transfer of the 
DNA to nitrocellulose, nylon, or another suitable membrane support for 
analysis with a radiolabeled, biotinylated, or enzyme-labeled probe as 
described in sections 9.37-9.52 of Sambrook et al., supra. 

25 "Northern analysis" or "Northern blotting" is a method used to 

identify RNA sequences that hybridize to a known probe such as an 
oligonucleotide, DNA fragment, cDNA or fragment thereof, or RNA fragment 
The probe is labeled with a radioisotope such as 32-P, by biotinylation or with 
an enzyme. The RNA to be analyzed can be usually electrophoretically 

30 separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, 
nylon, or other suitable membrane, and hybridized with the probe, using 
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standard techniques well known in the art such as those described in sections 
7.39-7.52 of Sambrook et at, supra. 

"Polymerase chain reaction" or "PCR" refers to a procedure or 
technique in which amounts of a preselected piece of nucleic acid, RNA 
5 and/or DNA, are amplified as described in U.S. Patent No. 4,683,195. 
Generally, sequence information from the ends of the region of interest or 
beyond is employed to design oligonucleotide primers. These primers will be 
identical or similar in sequence to opposite strands of the template to be 
amplified. PCR can be used to amplify specific RNA sequences, specific 
10 DNA sequences from total genomic DNA, and cDNA transcribed from total 
cellular RNA, bacteriophage or plasmid sequences, and the like. See 
generally Mullis et al., Cold Spring Harbor Symp Quant. Biol,, 51, 263 
(1987); Eriich, ed, PCR Technology, (Stockton Press, NY, 1989). 

"Stringent conditions" are those that (1) employ low ionic 
15 strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 
M sodium citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50°C, or (2) 
employ during hybridization a denaturing agent such as formamide, for 
example, 50% (vol/vol) formamide with 0.1% bovine serum albumin/0.1% 
Ficoll/0.1% r»lyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 
20 with 750 mMNaCl, 75 mM sodium citrate at 42°C Another example is use 
of 50% forrnarnide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM 
sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardfs 
solution, sonicated salmon sperm DNA (50 ^g/ml), 0.1% SDS, and 10% 
dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC and 0.1% SDS. 
25 When hK2 polypeptide is expressed in a recombinant cell other 

than one of human origin, the hK2 polypeptide is completely free of proteins 
or polypeptides of human origin. However, h is necessary to purify hK2 
polypeptide from recombinant cell proteins or polypeptides to obtain 
preparations that are substantially homogeneous as to hK2 polypeptide. For 
30 example, the culture medium or lysate can be centrifuged to remove 

particulate cell debris. The membrane and soluble protein fractions are then 
separated The hK2 polypeptide may then be purified from the soluble 
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protein fraction and, if necessary, from the membrane fraction of the culture 
lysate. HK2 polypeptide can then be purified from contaminant soluble 
proteins and polypeptides by fractionation on immunoafBnity or ion-exchange 
columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 

5 or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; 
ammonium sulfate precipitation; gel filtration using, for example, Sephadex 
G-75; or ligand affinity chromatography. 

Once isolated from the resulting transgenic host cells, 
derivatives and variants of the hK2 polypeptide can te^ For 

10 example, amides of the hK2 polypeptides of the present invention may also be 
prepared by techniques well known in the art for converting a carboxylic acid 
group or precursor, to an amide. A preferred method for amide formation at 
the C-terminal carboxyl group is to cleave the polypeptide from a solid 
support with an appropriate amine, or to cleave in the presence of an alcohol, 

1 5 yielding an ester, followed by aminolysis with the desired amine. 

Salts of carboxyl groups of the hK2 polypeptide may be 
prepared in the usual manner by contacting the peptide with one or more 
equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 

20 example, sodium carbonate or sodium bicarbonate; or an amine base such as, 
for example, triemylamine, triethanolamine, and the like. 

N-acyl derivatives of an amino group of the present 
polypeptides may be prepared by utilizing an N-acyl protected amino acid for 
the final condensation, or by acylating a protected or unprotected peptide. O 

25 acyl derivatives may be prepared, for example, by acylation of a free hydroxy 
peptide or peptide resin. Either acylation may be carried out using standard 
acylating reagents such as acyl halides, anhydrides, acyl imidazoles, and the 
like. Both N- and O-acylation may be carried out together, if desired In 
addition, the internal hK2 amino acid sequence of Fig. 4 can be modified by 

30 substituting one or two conservative amino acid substitutions for the positions 
specified, including substitutions which utilize the D rather than L form. The 
invention is also directed to variant or modified forms of the hK2 polypeptide 
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of Fig. 4. One or more of the residues of this polypeptide can be altered, so 
long as antigenic function is retained Conservative amino acid substitutions 
are preferred-that is, for example, aspartic-glutamic as acidic amino acids; 
lysine/arginine/histidine as basic amino acids; leucine/isoleucine, 

5 methionine/valine as hydrophobic amino acids; 

serme/glycme/alarJne/threonine as hydrophilic amino acids. 

Acid addition salts of the polypeptides may be prepared by 
contacting the polypeptide with one or more equivalents of the desired 
inorganic or organic acid, such as, for example, hydrochloric acid. Esters of 

10 carboxyl groups of the polypeptides may also be prepared by any of the usual 
methods known in the art. 

Once isolated, hK2 polypeptide and its antigenicity active 
variants, derivatives and fragments thereof can be used in assays for hK2 in 
samples derived from biological materials suspected of containing hK2 or 

15 anti-hK2 antibodies, as disclosed in detail in Serial No. 08/096,946. For 
example, the hK2 polypeptide can be labelled with a detectable label, such as 
via one or more radiolabelled peptidyl residues, and can be used to compete 
with endogenous hK2 for binding to anti-hK2 antibodies, i.e., as a "capture 
antigen" to bind to anti-hK2 antibodies in a sample of a physiological fluid, 

20 via various competitive immunoassay format for hK2 which uses immobilized 
anti-hK2 antibodies is carried out by: 

(a) providing an amount of anti-hK2 antibodies attached to a solid 
surface; 

(b) mixing the sample of physiological fluid to be tested with a 
25 known amount of hK2 polypeptide which comprises a 

detectable label, to produce a mixed sample; 

(c) contacting said antibodies on said solid surface with said mixed 
sample for a sufficient time to allow immunological reactions 
to occur between said antibodies and said hK2, and between 

30 said antibodies and said labelled polypeptide; 

(d) separating the solid surface from the mixed sample; 
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(e) detecting or determining the presence or amount of labelled 
polypeptide either bound to the antibodies on the solid surface 
or remaining in the mixed sample; and 

(f) determining from the result in step (e) the presence or amount 
5 of said hK2 in said sample. 

In another format which can detect endogenous hK2 in a 
sample by a competitive inhibition immunoassay, a known amount of anti- 
hK2 antibody is added to a sample containing an unknown amount of 
endogenous hK2. The known amount is selected to be less than the amount 

10 required to complex all of the hK2 suspected to be present, e.g, that would be 
present in a sample of the same amount of physiological fluid obtained from a 
patient known to be prostate cancer. Next, a known amount of the hK2 
polypeptide of the invention or a subunh thereof comprising a detectable 
label is added If endogenous hK2 is present in the sample, fewer antibodies 

15 will be available to bind the labelled hK2 polypeptide, and it will remain free 
in solution. If no endogenous hK2 is present, the added labelled polypeptide 
will complex with the added anti-hK2 antibodies to form binary complexes. 
Next, the binary antibody-antigen complexes are precipitated by an anti- 
mammal IgG antibody (sheep, goat, mouse, etc.). The amount of radioactivity 

20 or other label in the precipitate (a ternary complex) is inversely proportional 
to the amount of endogenous hK2 that is present in the sample, e.g, a pellet 
containing reduced amounts of radioactivity is indicative of the presence of 
endogenous hK2. 

Alternatively to the conventional techniques for preparing 

25 polyclonal antibodies or antisera in laboratory and farm animals, monoclonal 
antibodies against hK2 polypeptide can be prepared using known hybridoma 
cell culture techniques. In general, this method involves prepared an 
antibcxiy-producing fused cell line, e.g., of primary spleen cells fused with a 
compatible continuous line of myeloma cells, and growing the fused cells 

30 either in mass culture or in an animal species from which the myeloma cell 
line used was derived or is compatible. Such antibodies offer many 
advantages in comparison to those produced by inoculation of animals, as they 
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are highly specific and sensitive and relatively "pure" immunochemically. 
Immunologically active fragments of the present antibodies are also within the 
scope of the present invention, e.g., the f(ab) fragment, as are partially 
humanized monoclonal antibodies. 
5 The invention will be further described by reference to the 

following detailed examples. 

Example 1. 

fhnrinicikin of hK2 expression vectors 
10 (A) Generation of recombinant baculoviruses containing tmhK2 and mhK2 

coding sequences 

A cDNA (approximately 820 bp long) encoding the entire 
prepro-hK2 (pphK2) (from nucleotide #40 to #858 relative to the start site of 
the pphK2 transcript), as shown in Fig. 5, was synthesized from RNA of 

15 human BPH tissue using reverse-transcription polymerase chain reaction (RT- 
PCR) technology with a pair of hK2 specific oligonucleotide primers 
(5 , ACGCGGATCCAGCATGTGGGACCTGGTrCTCT3' SEQ ID NO: 2 and 
5'ACAGCTGCAGTTTACTAGAGGTAGGGGTGGGAC 3' SEQ ID NO:3). 
This cDNA was generated such that 5' and 3' ends (with respect to pphK2 

20 sense sequence) were bracketed with BamHl and Pst 1 sequences 

respectively. The cDNA was then purified by agarose gel electrophoresis, and 
digested with BamHl and Pst 1 restriction enzymes. The restricted cDNA 
was ligated with the BamHl -Pst 1 digested pVL1393 plasmid vector and 
transformed into the E. coli HB101 straia E. coli harboring pphK2 

25 cDNA/pVL1393 plasmid vector were selected and verified by restriction 

enzyme mapping and DNA sequencing. Plasmid pphK2 cDNA/pVL1393 was 
mass-produced in E. coli and purified by CsCl gradient ultra-centrifugatioa 
cDNA encoding the mature hK2 was synthesized using PCR 
with the aforementioned pphK2 cDNA as the template plus a pair of hK2 

30 oligonucleotides (S'ACGCGGATCCAGCATGTGGGACCTGGTTCTCD' 
SEQ ID NO: 2 and 

5 , ACCGGAAT^CATGATTGTGGGAGGCTGGGAGTGT3 , SEQ ID NO: 4). 
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As noted, the 3* end oligonucleotide was the same 3' end oligonucleotide used 
for synthesizing the pphK2 cDNA. However, the 5' end oligonucleotide was 
different from the 5' oligonucleotide used for the pphK2 cDNA, and therefore 
generates a cDNA coding for the mature form of hK2 (mhK2), as shown in 
5 Fig. 6. The mhK2 cDNA was bracketed with EcoRl and Pstl sequences at 
the 5' and 3' ends respectively. The protein produced from the mhK2 cDNA 
will gain an exogenous methionine at its N-terminus. The mhK2/pVL1393 
vector was generated and purified as described for pphK2/pVL1393. The 
DNA sequence analysis for pphK2 and mhK2 in pVL1393 showed that one 

10 nucleotide (#805) has been altered (G to T) in a silent mutation. 

pphK2/pVL1393 or mhK2/pVL1393 DNA (2 ug) were 
cotransfected with a linearized Baculogold DNA (0.5 ug; Pharmingen, San 
Diego, CA) into ^9 insect cells according to Pharmingen instructions (S. 
Gruenwold et al., baculovirus expression vector system: Procedures and 

15 Methods Manual, Pharmingen, San Diego, CA (1993)). Four to six days after 
the transfection, ^9 cell spent medium containing viral particles was harvested 
and used to infect fresh ^9 cells to amplify viral titers. Total RNA was 
isolated for Northern blot analysis of authentic pphK2 or mhK2 transcript 
using hK2 cDNA as a probe. Further proof of pphK2 or mhK2 transcript 

20 expressed in recombinant virus infected ^9 cells was obtained by RT-PCR 
and DNA sequencing. Pure recombinant baculovirus containing pphK2 or 
mhK2 were obtained by secondary or tertiary plaque purification protocol 
according to instructions from Pharmingen (S. Gruenwold et al., cited above). 

25 Example 2. 

Generation of prokarvotic expression vector 

A 0.8 Id) fragment representing the entire preprohK2 (pphK2) 
coding sequence was generated by polymerase chain reaction (PCR) using 
primers A (5TATACATATGTGGGACCTGGTTCTCTCC3' SEQ ID NO.: 1 1) 

30 and B (S'ATATXjGATXXJrcAGGGGTTGGCTGCGATGGTJ' SEQ ID NO: 
12) and plasmid pVL1393 containing pphK2 as the template. The pphK2 
bacterial expression vector (pBPPHK2) was prepared by standard DNA 
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cloning technology, (Sambrook, cited above), to subclone this 0.8 kb fragment 
into the Ndel/BamHl site of the plasmid pPHS579 (a gift from Dr. H. 
Hsiung, Eli Lilly Co, Inc.) under the control of T7 promoter. The DNA of 
the entire insert plus the cloning sites was sequenced to confirm that no 
5 cloning artifacts had occurred and to ensure that no anomalies in the sequence 
had been generated by PCR pBPPHK2 was transformed into E. coli BL21 
(DE3)Lys S (Novagen, Inc., Madison, WI). 

10 feneration of a mammalian expression vector 

To express hK2 in mammalian cell lines, a 0.8 kb fragment 
representing the entire preprohk2 (pphK2) coding sequence was generated by 
PCR using primers 

A(5'ATATGGATCCATATGTCAGCATGTGGGACCTGGTTCTCTCCA3') 

15 (SEQ ID NO: 17) and 

BCS'ATATGGATCCTCAGGGGTTGGCTGCGATGGTS') (SEQ ID NO: 12) 

and plasmid pVL1393 containing pphK2 as the template. The mammalian 
expression vector (pGThK2) was prepared using standard DNA cloning 
technology (Sambrook, 1989), to clone this 0.8 kb fragment into the Bel 1 site 

20 of the plasmid pGT-d (a gift from Dr. Brian Griimell, Eli Lilly, Inc.) under 
control of the GBMT promoter. The DNA of the entire insert plus the 
cloning sites was sequenced to confirm that no cloning artifacts had occurred 
and to ensure that no anomalies in the sequence had been generated by PCR. 
AV12-664 (ATCC CRL-9595), a cell line derived from a adenovirus-induced 

25 tumors in Syrian hamster, was grown in Dulbecco's modified Eagle's medium 
supplemented with 10% fetal bovine serum (D10F) and transfected with 
plasmid pGThK2 using the calcium phosphate method 

30 Identification of recombinant pphK2 and mhK2 

ft, fc y ^lnvirus - i nsflfl system 
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SJ9 cells ^xlOVplate) were seeded onto 100 ram Coming 
plates with 10% fetal calf serum - Graces medium at room temperature for 1 
hr. After attachment on culture plates, cells were infected with wild type or 
recombinant baculovirus in serum free Excell-400 medium and incubated at 
5 27°C. Control cells were grown in the absence of virus. At designated times 
(24-96 hr) cells were placed in fresh S/9-000 media deficient of either 
methionine or methionine and cysteine for 45-60 min at 27°C, then incubated 
with Promix (0.143 mCi/plate; a mixture of [ 35 S]-methionine and [ 35 S]- 
cysteine; 1,000-1,400 Ci/mmol; Amersham) in serum free and 

10 methionine/cysteine deficient S9IIOO medium (Biofluids) for 5-8 hr or 20 nr. 
After the end of each incubation time, cells and spent media were separated 
by centrifiigation (1,000 rpm; Beckman J-6B; Beckman, Fullerton, CA). Cells 
were washed and centrifiiged (13,000 rpm; Biofuge 13, Baxter) twice. The 
washed cells were lysed by freeze/thaw in a detergent buffer (10 mM Tris, pH 

15 7.5; 130 mM NaCl, 1% Triton X-100, 10 mM NaF; 10 mM NaPi, lOmM 
Nappi, pH7.5) or H 2 0 and centrifiiged to obtain cytosol and insoluble cellular 
fractions. Protein contents of the above samples were determined by either 
the Bradford or Lowry method (BioRad, Inc., Melville, N.Y.). The above 
spent media, cystosol and insoluble cellular fraction were frozen and stored 

20 separately until used A duplicate set of samples were prepared without 35 S- 
labeling. 

For SDS-polyacrylamide gel electrophoresis (PAGE) analysis of 
expression of hK2 protein in Sj9 cells, samples were added to sample buffer 
(U.K. Laernmli, Nature, 221 680 (1970)), heated at 95°C for 5 minutes and 

25 subjected to SDS-PAGE under reducing conditions. 

Northern blot analysis was routinely used to screen and isolate 
clonal recombinant baculoviruses expressing pphK2 or mhK2 mRNA. A 
comparison of the corresponding lanes in both autoradiographs of the 
Northern blot and photographs of ethidiurn bromide staining of RNA shows 

30 that mRNA for pphK2 or mhK2 was present in recombinant virus infected Sfi 
but not in wild type virus-infected cells. Moreover, each of the pphK2 or 
mhK2 mRNA positive lanes represents RNA isolated from Sf) ceils infected 
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with recombinant viruses derived from a single viral plaque. Thus, the results 
suggest that high frequency (100%) of recombinant baculovirus containing 
either pphK2 or mhK2 was obtained from the above cotransfection. 
Furthermore, the sequences of pphK2 or mhK2 expressed in viral infected S/9 
5 cells were confirmed by a combination of RT-PCR, cloning and DNA 
sequencing. 

To determine whether the pphK2 protein is expressed in the 
insect cell Sfl, time course studies using 35 S-labeling of de novo synthesis of 
protein was performed and detected by SDS denaturing polyacrylamide gel 

10 electrophoresis (PAGE). As seen in the autoradiograph (Fig. 1), a unique 
protein (about 28 KDa) was found in pphK2-recombinant virus-infected S/9 
cells at 35-74 hour post-infection. This band was missing in uninfected cells 
or cells infected with wild type virus. The viral polyhedron protein (about 32 
KDa) was found (Fig. 1) as expected in S/9 cells infected with wild type 

15 virus, whereas it was not expressed by recorrMiant virus (Fig. 1). The 
protein was detected in cytosol when subcellular fractions (cytosol vs. 
insoluble fraction) was prepared by lysing cells with H 2 0 and freeze-thaw, 
whereas this 28 KD protein was detected in insoluble fraction when prepared 
by a detergent buffer and freeze-thaw (data not shown). 

20 Hie mhK2 protein was also expressed in the insect cell §9, 35 S- 

labeling of de novo synthesized protein was performed As seen in the 
autoradiograph (Fig. 2), a unique protein (about 28 KDa) was found in the 
insoluble fraction of mhK2-recornbinant virus-infected Sj9 cells at 48 hours 
post-infection. This band was missing in uninfected cells or cells infected 

25 with wild type virus. The viral polyhedron protein (about 32 KDa) was found 
in wild type virus-infected cells, whereas it was not expressed in cells infected 
with recombinant virus (Fig. 1). When the cytosol fraction was examined, no 
28 KDa band was observed. 

30 R F, Coli system 

Plasmid pBPPHK2 was transformed into E. coli BL21 (DE3) 
pLysS (Novagen, Inc., Madison, WI). This strain contains a chromosomal 
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copy of T7 RNA rx>lymerase under the control of inducible LaclIV5 
promoter. Upon addition of IPTG (isopropyl-(3-E)-u^ogalactcpyranoside) the 
expression of the T7 RNA polymerase is induced which in turn activates the 
T7 promoter resulting in overproduction of the gene product under control of 
5 this promoter. To determine whether the product of the ppHK2 gene would 
be expressed from pBPPHK2, single colonies of BL21 E. Coli transformed 
with pBPPHK2 were grown to O.D.^ = 0.2 in 10 ml LB media plus 
ampicillin (lOOugfal) and induced with 0.4 mM IPTG (Sigma, Inc.). Cells 
were harvested 2 hours after induction by centrifugarion and resuspended in 

10 1.5 ml SDS/PAGE sample buffer (UK. Laemmli, Nature. 227. 680 1970) 
before SDS/PAGE analysis. The cell pellet from the IPTG-induced culture 
was resuspended in 0.05M Tris, pH 8.0 (at 9ml/gm cell pellet) and stirred at 
room temperature (25°C, r.t) for 1 hour. Lysozyme (4 mgfail) was added to 
this suspension (at 1 ml/gm cell pellet) and the suspension was stirred at r.t 

15 for 30 min followed by incubation on ice for 30 mia The suspension was 
sonicated for 2 min at 150 watts and centrifuged at 3000xg to isolate the 
inclusion bodies. Inclusion bodies were resuspended in running buffer (25 
mM Tris, 192 mM glycine, 0.1% SDS) and after centrifugarion both the pellet 
and the supernatant were analyzed by SDS/PAGE 

20 About 90% of the pphK2 was found to be in the supernatant 

fraction which indicated that pphK2 is soluble in 0.1% SDS. To prepare 
samples for amino acid sequence analysis, 20ul of inclusion body lysate was 
subjected to SDS/PAGE on a 4-20% gradient gel (BIORAD, Inc., Melville, 
N.Y.). The protein was blotted from the gel onto 0.2u PVDF paper (BIO 

25 RAD) and stained with Coomassie blue. The protein band of interest was cut 
out from the blot and subjected to amino acid sequencing using a protein 
sequencer model 477A (Applied Biosystem, Inc., Foster City, CA). 

The induced cells overproduced large amounts of a polypeptide 
with apparent molecular mass of about 28kd (Figure 3). Densitometric 

30 analysis indicated that this protein comprised approximately 40% of total 
cellular protein. The size of this protein as determined by an SDS-PAGE gel 
was comparable to that predicted from coding sequence for pphK2. To 
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confirm that this protein is pphK2, the sequence of the first 10 amino acids 
(MWDLVLSIAL) (SEQ ID NO: 13) from the N-terminus was determined 
This sequence agrees perfectly with that deduced from the DNA sequence of 
pphK2 cDNA. As noted, it has different identity from the first 10 amino 
5 acids of both pphKl (MWFLVLCLAL) (SEQ ID NO: 14) and pphK3 
(MWVPWFLTL) (SEQ ID NO: 15). It also shows that this protein is not 
modified or processed at the N-terminus either during or after expression in E. 
coli. These results demonstrate that we were able to accurately express 
pphK2 in E. coli from pBPPHKZ 

10 

C Mammalian System 

1. Isolation and Purification of protein 

Plasmid pGThK2 was transformed into hamster cell line AVI 2- 
664 (ATCC-CRL-9595). To determine whether the product of the ppHK2 
15 gene would be expressed from pGThK2, AV12-pG1hK2 #2 was grown in 
D10F + 200nM MIX At about 60% confluency the cells were washed with 
Hank's balanced salt solution and resuspended in serum-free HH4 medium. 
The spent medium was collected after 7 days (serum-free spent medium) and 
stored at -20°C. Figure 8 depicts a SDS-PAGE confirming expression of 
20 recombinant pphK2 in a niarnmalian cell line. AV12-pGThK2 (Lane 4-6) and 
AV12-pGT-d (Lane 3) clonal cell lines were grown in D10F media. About 
300ul of spent medium from the above clones were concentrated and 
subjected to SDS/PAGE along with See Blue MW marker (lane 1) and pphK2 
lysate from E. coli cells (lane 2). The gel was blotted onto nitrocellulose 
25 paper and immunoblotted using a 1/1000 dilution of anti-pphK2 rabbit 

antiserum. HRP-goat anti-rabbit was used as the secondary probe and the blot 
was developed by DAB plus H^. Lane 3 (AV12-pGT-d) is AV12 
transfected with vector without insert 

To purify the protein, the serum-free spent medium was 
30 concentrated from 5-10 fold by ultrafiltration with a 10 kDa molecular weight 
cutoff membrane then dialyzed overnight at 4°C versus 50 mM sodium 
bicarbonate, pH 8. Samples were filtered with 0.2 u filters and then pumped 
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directly onto a TSK DEAE-5PW HPLC column (21 mm X 150 mm) at a flow 
rate of 5 mlVmia Buffer A contained 50 mM sodium bicarbonate, pH 7.9 
Buffer B contained 50 mM sodium bicarbonate plus 0.5 M sodium chloride, 
pH 7.6. The elution profile shown in Figure 9 was developed with a gradient 
5 from 0-50% Buffer B over 35 min; 50-100% B from 35-40 min and isocratic 
elution at 100% B for 5 min before re-equilibration in Buffer A- The flow 
rate was 5mL/min throughout 

DEAE fractions were assayed for the presence of hK2 by 
ELISA using rabbit anit-pphK2 as primary antibodies. The FT ISA assayed 

10 showed a peak of hK2 activity which eluted at approximately 0-2M NaCl 
(shown as the triangle line in Figure 9), which correlated well with the 
appearance of a 34 kDa band of protein seen by SDS-PAGE in the same 
fractions (data not shown). 

Fractions with hK2 activity were pooled and concentrated by 

15 ultrafiltration with 10 kDa membranes to approximately 5-8 mL where upon 
solid ammonium sulfate was added to make a final concentration of 1 M 
This sample was then injected onto a PolyLC. polypropyl aspartamide column, 
1000A pore size, 4.6 mm X 200 mm, to resolve protein by hydrophobic 
interaction chromatography (HIC, see Figure 10). Buffer A was 20 mM Na 

20 phosphate, 1.2 MNa sulfate pH 6.3 and Buffer B was 50 mM Na phosphate, 
5% 2-propanol, pH 7.4. The elution gradient was 0-20% B over 5 min; 20- 
55% B from 5-20 min, isocratic at 55% B from 20-23 min, 55-100% B from 
23-25 min; isocratic at 100% B for 2 min before re-equilibration Buffer A. 
The flow rate was 0.7 mL/rnia Greater than 90% of the was not 

25 retained on HIC column. The main peak retained on HIC, which eluted at 22 
min, also showed the highest peak of activity by ELISA assay (triangle line, 
Figure 10). 

HIC fractions which tested positive for hK2 on ELISA were 
pooled, ultrafilter concentrated as above to a volume less than 1 mL then 
30 injected on a 10/30 Pharmacia S12 size exclusion column equilibrated in 100 
mM ammonium acetate. The flow rate was 0.7 mL/mia When the 22 min 
peak from HIC was resolved by size exclusion chromatography, typically 
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about 80-90% of the protein eluted at 19.4 min, a retention time 
consistent with a protein of approximately 34 kDa (Figure 1 1). The only 
other protein peak on SEC, eluting at 16.7 min, corresponded to an about 
70KDa protein seen also in previous purification steps. 
5 To examine the efficiency of our purification scheme, 1 .5 ug of 

purified phK2 was subjected to SDS/PAGE in the presence or absence of (J- 
mercaptoethanol (BME), and the gel was stained with silver. Results showed 
that the phK2 in our sample was about 95% pure (Fig. 12). It also showed 
that pn>hK2 migrated at about 30 KD in the absence of BME, and it 
10 migrated at about 34 kDa in the presence of BME. This pattern is similar to 
that observed for the PSA purified from seminal fluid (Fig. 12). 

Recombinant phK2 is recognized by rabbit anti-pphK2, rabbit 
anti-PSA and a murine monoclonal antibody directed against a polypeptide 
covering amino acids 41-56 of hK2, when analyzed on WESTERN blots or 
15 when dried down on microtiter plates. However, phK2 was not detectable by 
these antibodies in sandwich assays. These results further demonstrate that 
the phK2 and PSA are conformationally different and the antibodies currently 
available to PSA or hK2 can not detect phK2 in its native form Furthermore, 
phK2 was not detectable by the Tandem R or free-PSA assays (immunological 
20 assays for detection of PSA in serum). 

A sample of the hybridoma (HK1A 523.5) secreting the murine 
monoclonal antibody has been deposited in the American Type Culture 
Collection, Rockville, MD, and assigned ATCC HB-11876. 

25 2. Amino Acid Analysis and Protein Sequencing of phK2 

Hie peak collected off size exclusion chromatography (SEC) in 
ammonium acetate was lyophilized to remove the buffer then reconstituted in 
water. An aliquot (2.5ug)of this sample was loaded on a Porton membrane 
(Beckman instruments) and subjected to automated N-terminal sequence 
30 analysis on an Applied Biosystems model 477A protein sequencer which 
yielded the following sequence: 
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Val-Pro-Loi-Ile-Gb-SCT-Arg-Ile-Val-Gly-Gly-Trp-Glu- (SEQ ID NO: 18). An 
aliquot of the same sample in water was also hydrolyzed in gaseous 6 N HQ 
under vacuum for 20 h at 112 °C men reconstituted in 0.1N HQ and analyzed 
on an Hewlett Packard Aminoquant amino acid analyzer utilizing pre-column 
5 derealization of amino acids with OPA for primary and FMOC for secondary 
amines. 

No competing sequence was evident from the profile of amino 
acids released sequentially by the Edman degradation procedure. By analogy 
to PSA this protein is pro hK2, since the known sequence of mature PSA has 
10 been shown to begin with Ileu-Val-Gly-etc and pro PSA has been postulated 
to have an extra 7 amino acids at the N-terminus. Amino acid analysis of this 
protein yielded an amino acid composition consistent with the recombinant 
sequence of prohK2. These results demonstrate that pphK2 was accurately 
expressed in the mammalian cell line AVI 2-664 from pGThK2. 

15 

Example 5. 

Production of antibodies to recombinant pphK2 

A. F. Coli System 

To prepare pphK2 for rabbit immunization, the inclusion bodies 
20 obtained from bacterial cultures of BL21 (pBpphK2) after IPTG induction as 
in Example 4B were resuspended in lOOul SDS/PAGE sample buffer/ml 
bacterial culture and electrophoresed on preparative SDS/PAGE The pphK2 
band was excised and electroeluted from the gel into running buffer (25 mM 
Tris, 192 mM glycine, 0.1% SDS) and used as the immunogea Two rabbits 

25 were each inimunized with lOOug of the immunogen in complete Freund's 
adjuvant and were boosted twice in three week intervals with lOOug of the 
immunogen in incomplete Freund's adjuvant and PBS, respectively. Rabbit 
anti-pphK2 sera was obtained one week following the second boost The 
presence of anti-pphK2 in the rabbit antiserum was shown by EUSA (data not 

30 shown). Once confirmed by this method, the highest titer antiserum was 
tested on Western blots using lysates from IPTG induced or non-induced 
cultures of BI.21 (pBpphK2). It was evident that the antiserum contained 
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antibodies highly specific for the pphK2 protein since a protein band at about 
28kd corresponding to pphK2 was recognized only in the induced lysate. Hie 
antiserum also recognized the purified pphK2 further showing the specificity 
of the antibodies to pphK2. The above data demonstrate that the antibodies 
5 recognize the prepro-form of hK2. 

To delineate if the antiserum recognizes the mature form of 
hK2 (mhK2), mhK2 was expressed in E. coli as a glutathione S-transferase 
fusion protein (GST-mhK2, 58kd), and the cell lysate was immunoblotted 
using anit-pphK2 rabbit antiserum It was evident that anti-pphK2 antiserum 

10 recognized the GST-mkK2, demonstrating that antibodies were at least in part 
against the mature region of pphK2. To examine the pattern recognized in 
seminal fluid by anti-pphK2 antibodies, seminal fluid was prepared from 
pooled semen as described by Sensabaugh and Blake, J. Urology. 142, 1523 
(1990), and immunoblotted with anti-pphK2 rabbit antiserum. The antiserum 

15 recognized a major band at about 34kd plus several minor bands at lower 
MW. The pre-immune serum did not recognize any bands in any of the 
above experiments, showing that the antibodies were generated by 
immunization. 

To determine whether there are any pphK2-specific antibodies 
20 in rabbit anit-pphK2 antiserum, the antibodies cross-reacting to PSA were 
absorbed out of the antiserum by a PSA affinity resin. Specifically, 1ml of 
the sera was diluted with ImL 100 mM HEPES, pE 7.5 and incubated with 
native PSA-bound Affigel-10 for 3.5 hours at 4°C. Hie mixture was used 
poured into a column, the flow-through was collected and the column was 
25 washed with 30 ml HEPES buffer. Antibodies bound to the column (eluate) 
were eluted by acetic acid (IN, pH 4.0) and neutralized to pR 6.6 with 
NH4OH. Native PSA was isolated from seminal fluid as described by 
Sensabaugh and Blake, cited above. ppPSA was purified from E. coli 
transformed with plasmid pPHS579 (containing ppPSA under control of T7 
30 promoter) using a procedure analogous to pphK2 purification. 

The flow-through and the column eluate were tested for Abs 
recognizing pphK2, ppPSA and native PSA (PSA isolated from seminal fluid) 
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using Western blot analysis. It was evident that antibodies in the untreated 
rabbit anti-pphK2 antiserum recognized all three proteins indicating that 
pphK2, ppPSA and seminal fluid-PSA share some similar epitopes. However, 
while the column eluate contained antibodies that recognized all three protein, 
5 the flow-throu^i contained antibodies that recognized only pphK2. This 
indicates that anti-pphK2 antiserum contains pphK2-specific antibodies and 
these antibodies can be isolated by PSA affinity absorption. This system 
enabled us to generate anti-pphK2 antibodies which recognize both pphK2 and 
mhK2. Thus, utilizing immunogenic and pure* recombinant hK2 protein, 
10 generate rabbit antiserum was generated which contains pphK2-specific 

antibodies, providing a valuable source for generating and screening for hK2- 
specific monoclonal antibodies. 

These examples describe the use of three heterologous 
expression systems (i.e. both prokaryotic and eukaryotic) for the successful 
15 expression of the hK2 polypeptide. Thus, the method of the invention enables 
production of large quantities of substantially pure hK2 polypeptide. The 
polypeptide can be used both to study its biological functions and to produce 
immunodetection reagents such as labelled hK2 polypeptide, labelled 
fragments thereof and antibodies thereto. The immunoreagents can provide a 
20 method to purify native hK2 and to study the properties of the purified native 
hK2 polypeptide. 

The pphK2 overproduced in E coli can be readily solubilized 
in 0.1% SDS, thus solubility is not a problem. This is in contrast to the 
expression of human salivary kallikrein protein, hKl, in E. coli, which was 
25 found in insoluble inclusion bodies (J. Wang, et al Biochem. J., 226, 63 
(1991)). In contrast, the present invention yields almost pure protein which 
can be purified to homogeneity by rjreparatrve SDS-PAGE. This purified 
recombinant pphK2 can be used for the generation of monoclonal and 
polyclonal antibodies. 
30 As shown above, Baculogold viral DNA can be used to 

generate a recombinant baculovirus containing pphK2 or mhK2. Use of 
Baculogold viral DNA provides high selection of positive recombinant 
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baculoviiuses. Indeed, Northern blot analysis showed a high frequency of 
recombinant virus expressing pphK2 or mhK2 rnRNA Moreover, SDS-PAGE 
analysis showed that both pphK2 and mhK2 recombinant viruses produced 
unique proteins with sizes similar to the calculated molecular weights for 

5 pphK2 or mhK2. Although the levels of the recombinant hK2 expressed in 
insect system may not be as high as in E. coli, the hK2 protein produced in 
baculovirus-insect system may contain the secreted form which would be 
more like the natural form of the protein. 

Plasmids pphK2/pVL1393 in E. col. H13101 has been 

10 deposited in the American Type Culture Collection, Rockville, MD, USA on 
May 2, 1994 under the provisions of the Budapest Treaty and have been 
assigned accession number ATCC 69614. 

The invention has been described with reference to various 
specific and preferred embodiments and techniques. However, it should be 

15 understood that many variations and modifications may be made while 
remaining within the spirit and scope of the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Mayo Foundation for Medical Education 
and Research 
Hybritech Incorporated 
Tindall, Donald" J. 
Young, Charles Y.F. 
Saedi, Mohanmed S. 

(ii) TITLE OF INVENTION: Recombinant HK2 Polypeptide 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) AD DRESS EE: Schwegman, Umdberg & Woessner, P. A. 

(B) STRE ET: 3500 IDS Center 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) COUNTRY: USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Raasch, Kevin W. 

(B) REGISTRATION NUMBER: 35,561 

(C) REFERENCE/DOCKET NUMBER: 150.148W01 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-339-0331 

(B) TELEFAX: 612-339-3061 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

He Val. Gly Gly Trp Glu Cys Glu Lys His 
15 10 

Leu Val Ala Ser Arg Gly Arg Ala Val Cys 
20 25 

Pro Gin Trp Val Leu Thr Ala Ala His Cys 
35 40 

He Leu Leu Gly Arg His Ser Leu Phe His 
50 55 

Val Phe Gin Val Ser Thr Ser Phe Pro His 
65 70 

Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly 
85 90 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala 
100 105 

Lvs Val Met Asp Leu Pro Thr Gin Glu Pro 
115 120 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro 
130 135 

Lys Lys- Leu Gin Cys Val Gin Leu His Val 
145 150 

Ala Gin Val His Pro Gin Lys Val Thr Lys 
165 170 

Arg Trp Thr Gly Gly Lys Ser Thr Cys Ser 
180 185 

Leu Val Cys Asn Gly Val Leu Gin Gly He 
195 200 

Pro Cys Ala Leu Pro Glu Arg Pro Ser Leu 
210 215 

Tyr Arg Lys Trp He Lys Asp Thr He Val 
225 230 



Ser Gin Pro Trp Gin Val 
15 

Gly Gly Val Leu Val His 
30 

He Arg Asn Lys Ser Val 
45 

Pro Glu Asp Thr Gly Gin 
60 

Pro Leu Tyr Asp Met Ser 
75 80 

Asp Asp Ser Ser His Asp 
95 

Glu Leu Thr Asp Ala Val 
110 

Ala Leu Gly Thr Thr Cys 
125 

Glu Glu Phe Leu Thr Pro 
140 

He Ser Asn Asp Val Cys 
155 160 

Phe Met Leu Cys Ala Gly 
175 

Gly Asp Ser Gl^ Gly Pro 

Thr Ser Trp Gly Ser Glu 
205 

Tyr Thr Lys Val Val His 
220 

Ala Asn Pro 
235 
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(2) INFORMATION FOR SBQ 3D NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AOGOGGATCC AGCAXGTGGG ACCTOGTTCT CT 32 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACAGCTGCAG TTTACTAGAG GTAGGGGTGG GAC 33 



(2) INFORMATION FUR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
ACOOGAATTC ATGATTGTGG GAGGCTGGGA GKJT 



34 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10,. 792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCCAGC ATG TGG GAC CTG GTT CTC TCC ATC GCC TTG TCT GIG GGG 48 
Met Trp Asp Leu Val Leu Ser lie Ala Leu Ser Val Gly 
15 10 

TGC ACT QGT GCC GTG CCC CTC ATC CAG TCT OGG ATT CTG GGA GGC TGG 96 
Cys Thr Gly Ala Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp 
15 20 25 

GAG TCT GAG AAG CAT TCC CAA CCC TGG CAG GIG GCT GIG TAC AGT CAT 144 
Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His 
30 35 40 45 

GGA TGG GCA CAC TGT GGG GGT CTC CTG GTG CAC CCC CAG TGG GIG CTC 192 
Gly Trp Ala His Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu 
50 55 60 

ACA GCT GCC CAT TGC CIA AAG AAG AAT AGC CAG CTC TGG CTG GGT GGG 240 
Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg 
65 70 75 

CAC AAC CTG TIT GAG CCT GAA GAC ACA GGC CAG AGG CTC CCT GTC AGC 288 
His Asn Leu Hie Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser 
80 85 90 

CAC AGC TTC CCA CAC COG CTC TAC AAT ATG AGC CTT CTG AAG CAT CAA 336 
His Ser Phe Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin 
95 100 105 

AGC CTT AGA CCA GAT GAA GAC TCC AGC CAT GAC CTC AUG CTG CTC CGC 384 
Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg 
110 115 125 125 

CTG TCA GAG OCT GCC AAG ATC ACA GAT CTT GTG AAG GTC CTG GGC CTG 432 
Leu Ser Glu Pro Ala Lys lie Thr Asp Val Val Lys Val Leu Gly Leu 
130 135 140 

CCC ACC CAG GAG OCA GCA CTG GGG AOC AOC TGC TAC GCC TCA GGC TGG 480 
Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp 
145 150 155 

GGC AGC ATC GAA CCA GAG GAG TTC TTG OGC CCC AGG AGT CTT CAG TGT 528 
Gly Ser He Glu Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys 
160 165 170 
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GTG AGC CTC CAT CTC CIG TCC AAT GAC ATG TGT GCT AGA GCT TAG TCT 
Val Ser Leu His Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser 
175 180 185 

GAG AAG GIG ACA OG TTC ATG TIG TGT GOT GGG CTC TGG ACA GOT GOT 
Glu Lys Val Thr Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly 
190 195 200 205 

AAA GAC ACT TOT GGG GOT GAT TCT GGG GOT OCA CTT OTC TCT AAT GOT 
Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly 
210 215 220 

GIG CTT CAA GOT ATC ACA TCA TGG GGC CCT GAG CCA TOT GCC CIG COT 
Val Leu Gin Gly He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro 
225 230 235 

GAA AAG COT GOT GIG TAC ACC AAG GIG GTG CAT TAC OGG AAG TGG ATC 
Glu Lys Pro Ala Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp He 
240 245 250 

AAG TAC ACC ATC GCA GCC AAC CCC TGAGIGOOOC TCTOOCAOOC CTACCTCTPG 
Lys Tyr Thr He Ala Ala Asn Pro 
255 



260 



576 



624 



672 



720 



768 



822 



TAAACIGCAG 



832 



(2) INFORMATION FOR SBQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Trp Asp Leu Val Leu Ser He Ala Leu Ser Val Gly Cys Thr Gly 
15 10 15 

Ala Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu 
20 25 30 

Lys His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala 
35 40 45 

His Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala 
BO 55 60 

His Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu 
65 70 75 80 

Phe Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe 
85 90 95 

Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg 
100 105 110 
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Pro Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu 
115 120 125 

Pro Ala Lys lie Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin 
130 135 140 

Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser lie 
145 150 155 16^ 

Glu Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu 
165 170 175 

His Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val 
180 185 190 

Thr Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr 
195 200 205 

Cvs Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin 
210 215 220 

Glv lie Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro 
225 230 235 240 

Ala Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp lie Lys Tyr Thr 
245 250 255 

He Ala Ala Asn Pro 
260 



(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATICN: 7. .720 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATTC ATG ATT GTG GGA GGC TGG GAG TGT GAG AAG CAT TCC CAA COC 48 
Met He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro 
1 5 10 

TGG CAG GTG GCT GTTG TAC AGHT CAT GGA TGG GCA CAC TGT GGG GGT GTC 96 
Tr^ Gin Val Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val 

CTG GIG CAC COC CAG TGG GTG CTC ACA GCT GCC CAT TGC CTA AAG AAG 144 
Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys 
35 40 45 
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AAT AGC CAG GTC TOG CIG OCT OGG CAC AAC CIG TIT GAG CCT GAA GAC 
Asn Ser Gin Val Tip Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp 
50 55 60 

ACA GGC CAG AGG GTC CCT GTC AGC CAC AGC TTC CCA CAC COG CTC TAC 
Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr 
65 70 75 

AAT A3G AGC CTT CTG AAG CAT CAA AGC CIT AGA CCA GAT GAA GAC TCC 
Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser 
80 85 90 

AGC CAT <3C CTC AIG CTG CTC GGC CTG TCA GAG CCT GCC AAG ATC ACA 
Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys He Thr 
95 100 105 110 

GAT GTT CTG AAG GTC CTG GGC CIG CCC ACC CAG GAG CCA GCA CTG GOG 
Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly 
115 120 125 

ACC ACC TGC TAC GCC TCA GGC TOG GGC AGC ATC GAA OCA GAG GAG TTC 
Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Hie 
130 135 140 

TIG OGC CCC AGG AGT CTT CAG TGT GIG AGC CTC CAT CTC CTG TCC AAT 
Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn 
145 150 



155 

GAC AIG TGT GCT AGA GCT TAC TCT GAG AAG CTG ACA GAG TTC ATG TIG 
Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu 
160 165 170 

TGT GCT GGG CTC TOG ACA GCT GCT AAA GAC ACT TCT GGG GCT GAT TCT 
Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser 
175 180 185 190 

GGG GCT CCA CTT GTC TCT AAT GCT CTG CTT CAA GCT ATC ACA TCA TGG 
Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp 
1 1 195 200 205 

GGC OCT GAG CCA TCT GCC CIG OCT GAA AAG CCT GCT GIG TAC ACC AAG 
Glv Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys 
1 210 215 220 

GIG GIG CAT TAC GGG AAG TGG ATC AAG TAC ACC ATC GCA GCC AAC CCC 
Val Val His Tyr Arg Lys Trp He Lys Tyr Thr He Ala Ala Asn Pro 
225 230 235 

TGAGIGCCCC TGTCCCACCC CTACCrCTAG TAAACTGCAG 
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(2) INFORMATION PQR SBQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin 
15 10 15 

Val Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val 
20 25 30 

His Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser 
35 40 45 

Gin Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly 
50 55 60 

Gin Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met 
65 70 75 80 

Ser Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His 
B5 90 95 

Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys He Thr Asp Val 
100 105 110 

Val Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr 
115 120 125 

Cys Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Arg 
* 130 135 140 

Pro Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met 
145 150 155 160 

Cvs Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala 
1 lis 170 175 

Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly 
1 180 185 190 

Pro Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Pro 
195 200 205 

Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val 
210 215 220 

His Tyr Arg Lys Trp He Lys Tyr Thr He Ala Ala Asn Pro 
225 230 235 
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(2) INFORMATION FOR SBQ 3D NO: 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCJIH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(ix) FEAIURE: 

(A) NAME/KEY: CDS 

(B) L0CA3TCN: 1..732 



96 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GIG CCC CTC ATC CAG TCT CGG AIT GIG GGA GGC TOG GAG TGT GAG AAG 48 
Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu Lys 
15 10 15 

CAT TCC CAA CCC TGG CAG GIG GCT GIG TAC ACT CAT GGA TOG GCA CAC 
His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His 
20 25 30 

TCT GGG GGT GTC CIG GIG CAC O0C CAG TGG GTG CTC ACA GCT GCC C3T 144 
Cvs Glv Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His 
1 1 35 40 45 

TCC CIA AAG AAG AAT AGC CAG GTC TCG CTG GZT CGG CAC AAC CIG TTT 192 
Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe 

GAG CCT GAA GAC ACA GGC CAG AGG GTC CCT GTC AGC CAC AGC TTC CCA 240 
Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro 
65 70 75 80 

CAC CCG CTC TAC AAT AIG AGC CTT CIG AAG CAT CAA AGC CIT AGA CCA 288 
His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro 

GAT GAA GAC TOC AGC CAT GAC CTC ATG CIG CTC OGC CTG TCA GAG OCT 336 
Asn Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro 
* * 100 105 110 

GCC AAG ATC ACA GAT GOT GIG AAG GTC CIG GGC CIG COC AGC OG GAG 384 
Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu 
115 120 125 

CCAGCACIGGGGACCAOCTSCTACGCC TCA GGC TGG GGC AGC ATC GAA 432 
Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu 
130 135 140 

CCA GAG GAG TTC TIG 03C OCC AGG ACT CTT <3\G TGT GIG AGC CTC CAT ' 480 
Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His 
145 150 155 160 

CTC CIG TCC AAT C3\C AIG TGT GCT AGA GCT TAC TCT GAG AAG GIG ACA 528 
Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr 
165 170 175 
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GAG TTC ATG TIG TGT GCT GGG CTC TGG ACA GGTT GGT AAA GAC ACT TUT 576 
Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys 
180 185 190 

GGG GGT GAT TCT GGG GGT CCA CTT GTC TUT AAT GGT GIG CTT CAA GGT 624 
Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly 
195 200 205 

ATC ACA TCA TGG GGC CCT GAG CCA TGT GCC CTG CCT GAA AAG OCT GCT 672 
He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala 
210. 215 220 

GTG TAG ACC AAG CTG GIG CAT TAC OGG AAG TGG ATC AAG TAC ACC ATC 720 
Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp He Lys Tyr Thr He 
225 230 235 240 

GCA GCC AAC CCC TGAGTGCCCC TGTOCCACCC CTACCTCTAG TAAA 766 
Ala Ala Asn Pro 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu Lys 
15 10 15 

His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His 
20 25 30 

Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His 
35 40 45 

Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe 
50 55 60 

Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro 
65 70 75 80 

His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro 
85 90 95 

Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro 
100 105 110 

Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu 
115 120 125 

Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu 
130 135 140 

Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His 
145 150 155 160 
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Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr 
165 170 175 

Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys 
180 185 190 

Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly 
* 195 200 205 

lie thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala 
210 . 215 220 

Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp lie Lys Tyr Thr lie 
225 230 235 240 

Ala Ala Asn Pro 



(2) INFORMATION FOR SEQ 3D NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAIACATAIG T3GGAOCIGG TTCTCTOC 28 



(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: cENA 



(xi) SEQUENCE DESCRIPTION: SEQ 3D NO: 12: 
ATAIGGATCC TCAGGGGITG GCTGC3GRIGG T 



31 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC7IH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Trp Asp Leu Val Leu Ser lie Ala Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Trp- Phe Leu Val Leu Cys Leu Ala Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Trp Val Pro Val Val Phe Leu Thr Leu 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
1 5 10 15 

Ala Val Trp Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val His 
20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin 
35 40 45 

Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin 
50 55 60 

Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 80 

Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp 
85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys lie Thr Asp Val Val 
100 105 110 

Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Arg Pro 
130 135 140 

Arq Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met Cys 
145 150 155 160 

Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala Gly 
a 165 170 175 

Leu Trp Thr Gl£ Gly Lys Asp Thr Cys Gly Gly Asp Ser Gl£ Gly Pro 

Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Pro Glu 
195 200 205 

Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arq Lys Trp He Lys Asp Thr He Ala Ala Asn Pro 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEN3IH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 
ATATGGATCC ATAIGTCAGC ATCTGGGACC TOjITCTCTC CA 42 



(2) DJP0RMA3TCN FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu 
15 10 
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WHAT IS CLAIMED IS: 

1. Isolated, substantially homogeneous pre-pro hK2 polypeptide. 
5 2. Isolated, substantially homogenous pro hK2 polypeptide. 

3. Isolated, substantially homogenous mature hK2 polypeptide. 

4. An antibody that is capable of specifically binding the hK2 polypeptide 
10 of claims 1, 2 or 3 and which does not bind to hK3. 

5. The antibody of claim 4 which is a monoclonal antibody. 

6. A hybridoma cell line producing the antibody of claim 5. 

15 

7. An isolated nucleic acid molecule encoding the polypeptides of claim 
1, 2 or 3. 

8. An isolated nucleic acid molecule selected from the group consisting 
20 of 

(a) cDNA comprising the nucleotide sequence of the coding region 
of the hK2 gene; 

(b) DNA capable of hybridizing under stringent conditions to a 
nucleotide sequence complementary to the nucleotide sequence 

25 of (a); and 

(c) a genetic variant of any of the DNA of (a) and (b) which 
encodes a polypeptide possessing an antigenic function of 
naturally occurring hK2 polypeptide. 



30 9. 



The nucleic acid molecule of claim 8 further comprising a promoter 
operably linked to the nucleic acid molecule. 
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10. A chimeric expression vector comprising the nucleic acid molecule of 
claim 7 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

5 11. The vector of claim 10 wherein the host cell is E. coli . 

12. The vector of claim 10 wherein the host cell is a mammalian cell. 

13. A host cell transformed with the vector of claim 10. 

10 

14. The host cell of claim 13 which is E_coJi. 

15. Tie host cell of claim 13 which is mammalian. 

15 16. A method of using a nucleic acid molecule encoding a hK2 

polypeptide comprising expressing the nucleic acid molecule of claim 
7 in a cultured host cell stably transformed with a chimeric vector 
comprising said nucleic acid molecule operably linked to control 
sequences recognized by the host cell transformed with the vector, and 

20 recovering hK2 polypeptide from the host cell 

17. The method of claim 16 wherein the host cell is E^j^li. 

18. The method of claim 16 wherein the host cell is mammalian. 

25 

19. The method of claim 16 wherein the nucleic acid molecule is DNA 

20. The method of claim 16 wherein the hK2 polypeptide is recovered 
from the host cell culture medium. 
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1 GaVXCAGCATGTGGGACCTGCTITCTCTCC^^ 
CCTAC<nCGTAC*£CC7C^CCAAGAaAG^ 

1> MetT rpAspLegVal LouSer ! I eAI aLeuSer Va I GJ yCysThr G1 yAl aVa I Pr 

66 CCTCATCCAGTCTCCX2^ 

GGAGTAGGTCAGAGCCTAACACCCTCCG^^ 
19> oLeul I eGl nSer Argl i eVal GI yGI yTrpGl uCysGI uLysHi sSer^GI nProTrpGI nValA 

131 CTt^H'ACAGTCATSC-ATGGGCACACTG^^ 
GAC^TCH^GT^CTACCC^^ 
41* I aVal TyrSer Hi sGlyTrpAl aHi sCysGlyGl yVa ! LeuVa I Hi sProGI nT rpVal LeuThf 

196 CXTIX^C CATTGCCTAAAGMGAATAGC C^GGTCT^KTr vGCTCC-GCACAACL'^'o ITi VSAQCCTGA 
CGA.CGGGT AAC^3GATTTCTTCTTA7CC^rrC CAGACCGACCCVXC C TtTTTGG ACAAACTCGGACT 
63* Al aAl aHi sCysleulysLysAsnSer Gi nVa I T rpLeuGl yA rgHl sAsnLeuPheGI uProGI 

261 Aa^CACl^GGCOVGA^GGTCCCT^ 

TCTGT^CGG*XTCCCAGGGAC^GrCGGT^^ 
84> uAspThf GI yGI nArgVa t ProVa I Ser Hi sSer PheProHi sProLeuTyrAsnMat Ser LeuL 

326 7C^ACCA?C^^GCCT?AGACC^ 

ACTTCCn'AGTTTCOaA^TCTGGTCTACTTCT^ 
106> euLy sHi sGI nSer LeuArgProAspGI uAspSer Ser Hi sAspLeuMa l LeuLeuArgLeuSer 

391 CAgCCTGCCAAGATCACACA' TG Tl V i ■ ^ aG7CCTGGGCC? CC CCACCCA^AGCCAC^C?g^ 
. CTCGCIACGGTTCrA£n\7rCTACAACACT7CCAGC^C CCGGACCXK7TGGGTCC7CGGTCGTGACCC 
128> GI uProAl aLysl I eThrAspVal Vat LysVal LeuGI yLeuProThr GI nGI oP roAl aLeuG! 

456 GACCACCTGCTAC£CC?CA«JC7X^^ 

CTGGTGGACGATGCGGAGTCCGACCCCG7CCTAGCT7GGTCTC 
149> yThr Thr CysTyrAI aSer GI yTrpGIySer 1 1 eGI uProGI uG! uPtieLeuArgProArgSer L 

521 TTaAGT<7ISJI^GCCTC 

AAGTCACACACTCGOAGGTAa^GGACAC^ 
171> euGI nCysVa I Ser LeuHi sLeuLeuSerAsnAspMa t CysAI aArgAI aTy t Ser GI uLy $Va I 

586 ACACACTTCATGTrGTOTXTGGGCTC^ 

TGTCTCAACTACAAC^CACGACCXGAGACCT^ 
193> Thr GI uPhaMa t LeuCysAI aGl yteuT rpThr GI yGI yLysAspThr CysGI yGI yAspSer 61 

651 GGGTCC^ Cn^ T C T olM TCOT^ 

CCCAGGT , l J ^ACAGACA7TACCACACGAA<nTCCAT^ 
214* yGI y ProLeuVa I CysAsnGI yVal LeuGI nGI y 1 1 eThr Ser T rpGI y ProGI uP roCysAl aL 

716 T^CTavAAOcaoc?CT<rr^^ 

ACGGACTTTTCG^GACACATGTOGTTCC^^ 
236> euProGI uLysProAl aValTyrThr LysVal Val HI sTyrArgLysTrpI I eLysTyrThr 1 1 e 

781 GCAGCC;^CCCC7GAC??GCCCCTGT^ 

CJTCCCTTGGGaaCTCACXCaACAGGGTGG^^ 
258>AlaAlaAsnPro 
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1 G^TT^TG-.TTGTC^A^^ 

1> ^tll eVa I GI yGI yTrpGI uCysGI uLysHi sSer Gl nPio 

ACCGTCC-^CCACACl-.TGTCAGTACCTACCCGTCTa^CACCCCCACAG 
15> TrpGlnValAI aVal TyrSer Hi sGlyTrpAl aHi sCysGlyGI yVal 

31> LeuVa I Hi sProGI nT rpVa I LeuThr Al aAl aHi sCysLeuLy sLys 

14 5 AATACCCJV^TCTTC^^^^ 

T7A7CG^i\:C^GACCGACCCA^^ 
47 ► AsnSer Gl nVal T rpLeuGI yA rgHi sAsnLeuPheGI uPf oGI uAsp 

193 ac^goccagac<:^ccctg?cv^ 

7Grrv;CGG7C7CCC-.GC^A^ 
63>Thr Gl yGI nArgVal ProValSer HisSer PheP roHr s ProLeuTyr 

241 AA7A7CACXrCT7CI^AAG£^7CAAAGCC77AGAC^ 

77ATAC T'C C-GAAGA C 77C G7A GTT7C GG A^ TCI GGTC T ACTTC TG AGG 
79 ► AsnMst Ser LeuLeuLy sHi sGf nSer LeuAr gProAspGI uAsp&sr 

2S9 AGCC^iTCttCCTCATCXr^ 

7CC<?7AC7GGAG7ACGACGAC^CC^CAG7C7CC^ 
95* Ser Hi sAspLeuMe t LeuLeuArgLeuSer Gl uProAl aLy s I I eThr 

337 GATG7TGTa^C<rrce7CGCCCTC^C^ 

CTACAACACTTCG\GGACCCG3A^^ 
lll^AspVal Val LysVal LeuGI yLeuProThr GI nGI uP roAl aLeuGly 

385 ACC\CCTGCTACCOC7CnGC^ 

TC-GTGGACGATG^GGAGTGCaVTCCCGTCGTAC^^ 
127>ThrThrCysTyrAlaSer GlyTrpGlySer 1 1 eGluProGluG! uPhe 

433 TTGCGCCCCVC^WnCITCAff 
AACGCGrjC^rreCTC^G/^Vora 
143 ► LeuArgProArgSer LeuGI nCysVal Ser LeuHi sLeuLeuSerAsn 

481 GAC-.7G7G7GCTACAGC7TAC7C7GAG^ 

C7GTAC^CtCGA7GTCC^A7G^GAC:Cri\:^C:^TC7CiAGrACAAG 
159>AspMetCysAI aArgAI aTyrSer Gl uLysVal Thr Gl uPheMetLeu 

529 TGTGCPGGG:nC?Ga^CV:-^ 

ACACGftCCCGAGACCTOTCC\CCA7^^ 
175> CysAI aGI yLeuT rpThr Gl yGI yLysAspThr CysGI yGI yAspSer 

577 GGGGGTCCACTTGTCTGTAATGGTGTQCT?^ 

CCCCCAGG?<^CAC^ATra 
191> Gl yGI yP roLeuVa I CysAsnGI yVa I LeuGI nGI y 1 1 eThr Ser T rp 
625 GGCCC7GAGCc^7:rrorcc7G::cTaw 

CCGGG^CTCGGTACACGGGACGGACTTTTCGGACGA^ 
207> GlyProGI uProCysAI aLeuProGI uLysProAl aVa I Ty rThr Lys 

673 GTC<rrGGA7TACCG.GAAG7C^.TCAAG7ACACCA7CC-CA.GCCAACCCC 

223 ► Val Val Hi sTyrArgLy sT rp 1 1 eLysTyrThr 1 1 eAI aAl aAsnPro 

721 7^GA<GTGCCCCTC7CCCACCC'7TACC^CTA^T>V^CTG-Z-iiG 
ACTCACGC^GACAGGGTGC-GGATC^.GATC^TC 
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1 GTGCCCCTCATCC^GTCTCGGATT^ 

CACGC^<W7TACX?IK^G^GCCTAA(^ 
l>Val ProLeul leGlnSorArgl t eVal GI yGI yTrpGI uCysGI uLysHi sSer GI nPro 

61 ?S<K*JGGTGGCTGTC7rACAG?CAT^ 

ACOrTTCCACCGACACYIKnO^ACCT^ 
21>TrpGl nValAlaValTyrSerHisGiyTrpAleHisCysGIyGlyVaHouValHisPro 

121 aiGTGGGTGCTCACACCjT^ 

ClXrACCCACGAGTGTCGACGarr^.CCGA'I 1 1 1 ^' I-C VI ATCGCTCCAGACCGACCCAOCC 
41> GI nT rpVa t LeuThr Al sAI aHi s CysLeuLy sLysAsnSer G! nVa I T rpLeuGI yA rg 

181 CACAACCTCHTTGASCCTGAAG&O.CAC^^ 

gtgit<x^caaactcc-:actt^^ 

61> Hi sAsnLeuPheGl uProGI uAspThr GI yG 1 nArgVa I ProVa I Sar Hi sSer PhePro 

241 O^CCGCTCTACAATATC^GCCTr^ 

GTGGGCGAGATGTTATACTCGGA^GACTICGTAGTTICGGAATCT^ 
81> Hi sProLeuTyrAsnMetSer LeuLeuLysHi sGI nSer LeuArgProAspGI uAspSer 

301 AGCCAT^CCTCWGCTCXTCCGCC^^ 

7CGGTACTGG\G?ACGACGAGGCGGACAGTCTO 
101> Ser HI sAspLeuMe t LeuLeuAf gLeuSer G! uP roAl aLysl I eThr AspVa I Vatlys 

361 GTCCTGGGCCTTCCCAC<raGGAGCCAC<^^ 

CAGGACCCGGACGGGTGGGTCCTCGGTCGT^CCCCT^ 
121> Va I LeuGI yLeuP roThr GI nGI uP roAl aLeuGI yThr Thr CysTy r Al a Ser GI yT rp 

42 1 GCXftGCMCGtoCCMAG^ 

CCGTCGTAGCTTG<nC?CCTCAAG- £ ACC^ 
141> GI ySer 1 1 eGl uP roGI uGl uPheLeu ArgProArgSer LeuGI nCyeVa 1 Ser LeuHi s 

481 CTCCTGTCC^ATGACA?GI<?r^3CTA 

GAGGACAGGTTACTGTACACACGATCTCGAA^^^ 
161* LeuLeuSer AsnAspMsl CysAI a ArgAI aTy rSer GI uLysVa I Thr GI uPheMet Leu 

541 TGTGCTGGGCTCTGoAC^GGTGGTAAAGACACTT^ 

ACAG^CCGAGACCTCnCCACCAT^ 
181* CysAl aGI yLeuT rpThr GI yGI yLysAspThr CysGI yGI yAspSer GI yG I yProLeu 

601 GTCTGWATSCTGTGCTTC^JSGT^^ 

CAGACATTACCACACGAA5T7CCA?AGTGTAG7ACCCCGGGACTCGCT^ 
201> Va I CysAsnGI yVa I LeuGI nGI y 1 1 eThr Ser T rpGl yProGi uProCysA! aLeuPro 

661 GAAAAGOCTGCTGTCTACAC^^ 

CTTirCGa^CGACACATCTGGTTCCAC^^^ 
221* GI uLysProAl aVa I TyrThr LyaVal Val HI sTyrArgLysTrpI I eLycTyrThr 1 1 e 

721 GCAGCCAACCCCTGAG?GCCCCrG?CCCACCCCTACCTCT 

CCTTCGCTITGCGGACTC^CCGGr^AC^ 
241>AlaAt aAsnPro 
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Size Exclusion Chromatography 
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RECOMBINANT HK2 POLYPEPTIDE 

5 Background of the Irwention 

Hie glandular kallikreins are a subgroup of serine proteases 
which are involved in the post-translational processing of specific polypeptide 
precursors to their biologically active forms. The human kallikrein gene 
family consists of three members: prostate-specific antigen, human glandular 

10 kallikrein, and pancreatic/renal kallikrein See J.A. Clements, Rndocr, Rev.. 
ID, 393 (1989) and T.M Chu et al. (U.S. Patent No. 4,446,122). A common 
nomenclature for these members of the tissue (glandular) kallikrein gene 
families was recently adopted by T. Berg et al., in Recent Progress nn Kinins: 
Biochemistry a nd Molecular Biology of the Kallikrein-Kinin System. AgSDlS 

1 5 and Actions Su pplements. Vol. L R Fritz et al.., eds., Birkkauser Verlag, 
Basel (1992), and is defined in Table I, below. 

TABLE 1 
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The DNA sequence homology between hKLK2 and hKLK3 
(exon regions) is 80%, whereas the homology between hKLK2 and hKLKl is 
65%. The dediired amino acid sequence homology of hK2 to hKl is 57% 
Amino acid sequences deduced by LJ. Schedlich et al., DNA. 6, 429 (1987) 

5 and BJ. Moms, riin F.xp Pharmacol. Phvsiol. 16, 345 (1989) indicate that 
hK2 may be a trypsin-like serine protease, whereas hK3 (PSA) is a 
chymotrypsin-like serine protease. Therefore, if hK2 is indeed secretory, it 
may have a different physiological function than hK3. 

The hKLK2 gene is located about 12 kbp downstream from the 

10 hKLK3 gene in a head-to-tail fashion on chromosome 19. (P.H. Riegman et 
al., FF.RS TiaL. 247. 123, (1989)). The similarities of gene structure and 
deduced amino acid sequences of these human kallikreins suggest that their 
evolution may involve the same ancestral gene. Most interestingly, as 
reported by Morris, cited supra; P. Chapdelaine, FEBS Lett. 236. 205 (1988); 

15 and Young, Biochemistry. 31. 1952 (1992), both hK2 and MG may be 
expressed only in the human prostate, while expression of hKl is limited to 
the pancreas, submandibular gland, kidney, and other nonprostate tissues. 

Tremendous interest has been generated in hK3 (PSA) because 
of the important role it plays as a marker to detect and to monitor progression 

20 of prostate carcinoma. Its usefulness as a marker is based on the elevated 
serum concentration of circulating hK3 proteins which are frequently 
associated with prostatic cancer. The serum concentration of hK3 has been 
found to be proportional to the cancer mass in untreated patients, but is also 
proportional to the volume of hyperplastic tissue in patients with benign 

25 prostatic hyperplasia (BPH). The serum levels of hK3 become reduced 
following prostate cancer therapy. 

Despite the information which can be ascertained about hK2 
from the genomic DNA sequence, very little is known about the hK2 
polypeptide itself. The reason for this is that the protein has not been purified 

30 and characterized Thus, a need exists for a method to obtain hK2 
polypeptide and related polypeptides in sufficient quantity and purity for 
characterization and for use as therapeutic/diagnostic agents or reagents. 
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Summary of the Invention 
The present invention provides an isolated, substantially 
homogenous hK2 polypeptide. As used herein, in the term "hK2 polypeptide" 
includes pre-pro WC2, pro hK2 and mature hK2 polypeptides. Pre-pro hK2 is 

5 secreted by the cell in vivo, and is cleaved during secretion to yield pro hK2. 
which is then enzymatically cleaved in the extracellular environment to yield 
"mature" hK2. Most preferably, the hK2 polypeptide is contiguous in amino 
acid sequence with SEQ ID NO. 16, SEQ ID NO: 6 or SEQ ID NO: 10. 

The present invention also provides isolated nucleic acid 

10 molecules encoding hK2 polypeptide, including (a) a cDNA molecule 

comprising the nucleotide sequence of the coding region of the hK2 gene; (b) 
a DNA molecule capable of hybridizing under stringent conditions to a 
nucleotide sequence complementary to the nucleotide sequence of (a); and (c) 
a genetic variant of any of the DNA molecules of (a) and (b) which encodes 

1 5 of polypeptide processing an antigenic function of naturally occurring hK2 
polypeptide. Preferably, the nucleic acid comprises a discrete, isolated DNA - 
or RNA molecule encoding the complete hK2 polypeptide, which can include 
the pre-pro, pro or mature forms. Most preferably, the nucleic acid is a DNA 
sequence contiguous with SEQ ID NO: 5, 7 or 9, i.e., as shown in Figs. 5, 6 

20 or 7. These DNA sequences can be produced using the polymerase chain 
reaction (PCR), and novel oligonucleotide primers employed in the synthesis 
are also an embodiment of the invention. 

The nucleic acid sequence also can comprise a promoter 
operably linked to the nucleic acid sequence. Therefore, the invention also 

25 comprises a chimeric expression vector comprising the above-described 

nucleic acid sequence, operationally linked to control sequences recognized by 
a host cell transformed with the vector, as well as said trarisforrned host cell, 
and methods of its preparation and use to produce recombinant hK2. Thus, 
the present invention also provides a method of using a nucleic acid molecule, 

30 such as a cDNA clone encoding hK2 polypeptide, comprising expressing the 
nucleic acid molecule in a cultured host cell trareformed, preferably stably 
transformed, with a chimeric expression vector comprising said nucleic acid 
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molecule operably linked to control sequences recognized by the host cell 
transformed with the vector, and recovering the hK2 polypeptide from the 
transgenic host cell, i.e., from the culture medium. As used herein, the term 
"chimeric" means that the vector comprises DNA from at least two different 
5 species, or comprises DNA from the same species, which is linked or 
associated in a manner which does not occur in the "native" form of said 
species. 

More specifically, E. coli and baculovirus insect cells systems 
have been employed to produce hK2 polypeptides in two forms, i.e. pre-pro 

10 hK2 (pphK2) and mature hK2 (mhK2). Thus, the present invention provides 
the first example of the overexpression of hK2 in heterologous systems. 
However, although pphK2 produced in E_£fili has proven to be an invaluable 
resource for generating antibodies to the denatured form of the protein, it is 
desirable to both discern the steps involved in the biosynthesis of hK2 and to 

15 obtain antibodies specific for the fully processed and secreted form of the 
protein. Therefore, mammalian cell systems have been employed to produce 
hK2 polypeptides. Thus, the present invention also provides the first example 
of the expression of hK2 in mammalian cells and purification and 
characterization of the secreted protein. 

20 The high degree of amino acid sequence homology of hK2 with 

hK3 indicates that measuring serum concentrations of both proteins may be 
useful in the diagnosis and monitoring of prostate cancer. For example, the 
antibodies developed against hK3 now used in these assays could theoretically 
also recognize hK2, because of mutual amtarnination in the antigenic 

25 preparations used to develop the anti-hK3 antibodies or because of antibody 
cross-reactivity between these two proteins. This could account for the 
substantial rjercentage of false positive results which are observed in current 
hK3 assays. On the other hand, if circulating hK2 levels are also elevated 
above baseline levels in prostate cancer patients, detection of hK2 by hK2- 

30 specific antibodies could provide an alternative, confirmatory assay for 
prostate cancer. 
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Therefore, hK2 polypeptide, as well as variants and subunits 
thereof, produced by the present method can be used to produce populations 
of antibodies that, in turn, can be used as the basis for assays to detect and 
quantify hK2 polypeptide (or "protein") in samples derived from tissues such 
5 as prostate carcinomas, cells such as prostate cell lines, or from fluids such as 
seminal fluid or blood. Thus, the present invention also provides populations 
of monoclonal or polyclonal antibodies that specifically bind to hK2 
polypeptide, while not significantly binding to hK3. The term "significantly" 
is defined by reference to the comparative assays discussed below. These 

10 antibodies can also be used in affinity chromatography, to purify rriammalian 
hK2 from natural sources. The isolated, substantially homogeneous hK2 can 
also be employed as a component in diagnostic assays for "native" hK2 in 
samples derived from human tissues or physiological fluids. For example, the 
recombinant hK2 can be bound to a detectable label and employed in 

15 competitive immunoassays for hK2, as described in U.S. patent application 
Serial No. 08/096,946, filed July 22, 1993. 

As used herein with respect to the present invention, the terms 
"hK2 polypeptide," "hK2 protein," and "hK2" are considered to refer to 
identical human materials, unless otherwise indicated 

20 

Brief Description of the Figures 
Figure 1 depicts a time course study of recombinant pphK2 in 
s/9 cells infected with recombinant pphK2 virus. At each of the time points 
cells were depleted of methionine and cysteine for 1 hour in deficient media 
25 and then supplemented with [ 35 S]-methione and [ 35 S]-cysteine. Protein was 
determined by Bradford assay. Aliquots of protein (20 ug) were loaded onto 
a 12% Tris-Glycine SDS gel. A Prwsphcrimager cassette was exposed 
overnight. The band of interest is indicated with an arrow, w.t: wild type. 

Figure 2 depicts the detection of recombinant mhK2 in cell 
30 lysate fractions. S/9 cells were infected either with recombinant mhK2, wild 
type or left uninfected for 48 hours. Methionine and cysteine pools were 
depleted for 1 hour in deficient media Cells were supplemented with [ 35 S> 
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methionine and [ 35 S>cysteine for 6 hours. Cells were separated into soluble 
and insoluble fractions using H 2 0 and repeated freeze/thaw conditions. 
Aliquots of protein (50 ug per lane) were loaded onto a 10% Tris-Glycine 
SDS gel and electrophoresed The gel was dried and exposed to x-ray film 
5 for 2 days. The band of interest is indicated with an arrow. 

Figure 3 depicts the expression of recombinant pphK2 in E. 
coli. E. coli strain BL21 (DE3) LysS harboring pBppHK2 was grown in LB 
media to O.D.^ 0.2 and incubated without (lane 2, not-induced (N)) or with 
(lane 3, induced (I)) 0.4 mM IPTG for 2 hrs. Cells were lysed in sample 

10 buffer and subjected to SDS/PAGE on a 4-20% gradient gel. Protein bands 
were visualized by staining the gel with Coomassie blue. 

Figure 4 depicts the amino acid sequences of mature hK2 
(deduced from cDNA sequence, SEQ ID NO: 16) and hK3 (SEQ ID NO: 1). 
Underlined sequences denote nonhomologous regions that can be used for 

15 preparation of antibodies specific to hK2. 

Figure 5 depicts pphK2 cDNA containing a BamHl site at the 
5' end and a Pstl site at the 3' end (SEQ ID NO: 5) (coding strand is 
numbered) as well as the amino acid sequence of pre-pro hK2 encoded 
thereby (SEQ ID NO: 6). The amino acid sequences of pro hK2 and mature 

20 hK2 are also shown on the Figure. 

Figure 6 depicts mhK2 cDNA containing an EcoRl site at the 
5' end and Pstl site at the 3* end (SEQ ID NO: 7), as well as the 
corresponding amino acid sequence (SEQ ID NO. 8) which encompasses the 
amino acid sequence of mhK2 polypeptide. 

25 Figure 7 depicts pro hK2 DNA (SEQ ID NO 9) (coding strand 

is numbered) and the amino acid sequence of pro hK2 (SEQ ID NO: 10). 

Figure 8 depicts a gel ccmfirming the expression of recombinant 
pphK2 in a mammalian ceil line. AV12-pGThK2 (Lane 4-6) and AV12-pGT- 
d (Lane 3) clonal cell lines were grown in D10F media. About 300ul of 

30 spent medium from the above clones were concentrated and subjected to 
SDS/PAGE along with See Blue MW marker (lane 1) and pphK2 lysate from 
E. coli ceils (lane 2). The gel was blotted onto nitrocellulose paper and 
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immunoblotted using a 1/1000 dilution of anti-pphK2 rabbit antiserum. HRP- 
goat anti-rabbit was used as the secondary probe and the blot was developed 
by DAB plus Hfo. Lane 3 (AV12-pGT-d) is AV12 transfected with vector 
without insert 

5 Figure 9 depicts the DEAE chromatography of AV12 media 

The sample was applied in a bicarbonte buffer, pH 8 and eluted with a salt 
gradient The solid line is the A^ elution profile. The triangle line 
represents the ELISA assay of individual samples which had been dried onto 
microtiter plates and developed with rabbit anti-hK2 antibody. 

10 Figure 10 depicts the hydrophobic interaction profile of DEAE 

fractions. The fractions were pooled, concentrated and applied to an HIC 
column in \2 M sodium sulfate, and eluted with a decreasing salt gradient 
The solid line is A^ and the triangle line shows the ELISA assay profile of 
the fractions using rabbit anti-hK2 antibody. 

15 Figure 1 1 depicts the Size Exclusion Chromatography of HIC 

purified prohK2, in particular, the A^ profile of 22 min peak eluted off HIC 
column. The 19.4 min peak appears homogeneous by SDS-PAGE. After this 
peak was lyophilized, the N-terrninal sequence and amino acid composition 
confirmed its identity as the pro form of hK2. 

20 Figure 12 depicts the SDS/PAGE analysis of prohK2 and PSA. 

1.5ug of purified phK2 or PSA was boiled in sample buffer containing (R) or 
not containing (N) 1% BME. Samples were subjected to SDS/PAGE on a 4- 
20% gel. The protein bands were visualized by staining the gel with silver. 

25 Detailed Description of the Invention 

As used herein, die term "hK2 polypeptide" preferably 
encompasses the recombinant pre-pro, pro and mature hK2 polypeptides. As 
proposed herein, a mature hK2 polypeptide having the amino acid sequence 
shown in Fig. 4 (SEQ ID NO: 16), as well as "variant" polypeptides which 

30 share at least 90% homology with SEQ ID NO: 16 in the regions which are 
substantially homologous with hK3, i.e., which regions are not identified by 
bars as shown in Fig. 4. Such hK2 polypeptides also possess antigenic 
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function in common with the mature hK2 molecule of Fig. 4, in that said 
polypeptides are also definable by antibodies which bind specifically thereto, 
but which do not cross-react with hK3 (or hKl). Preferably, said antibodies 
react with antigenic sites or epitopes that are also present on the mature hK2 

5 molecule of Fig. 4. Antibodies useful to define common antigenic function 
are described in detail in Ser. No. 08/096,946, i.e., polyclonal antisera 
prepared in vivo against hK2 submit 41-56. 

"Isolated hK2 nucleic acid" is RNA or DNA containing greater 
than 15, preferably 20 or more, sequential nucleotide bases mat encode a 

10 biologically active hK2 polypeptide or a variant fragment thereof, that is 

— — i^^r^**-., *~ +u A n^t^wiinn cfrwl r»f natiw hYl rml vrvrtfide RNA 

\*AJl ll|JIV41 IV* J I.W UIV Itvil wwuij v**h>«— v. — • - j J r ~ I 

or DNA, or hybridizes to said RNA or DNA and remains stably bound under 
stringent conditions. Thus, the RNA or DNA is isolated in that it is free from 
at least one contaminating nucleic acid with which it is normally associated in 

15 the natural source and is preferably substantially free of any other mammalian 
RNA or DNA. The phrase "free from at least one contaminating source 
nucleic acid with which it is normally associated" includes the case where the 
nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 

20 normally found in the source cell. An example of isolated hK2 nucleic acid is 
RNA or DNA that encodes a biologically active hK2 polypeptide sharing at 
least 90% sequence identity with the hK3-homologous regions of the hK2 
peptide of Fig. 4, as described above. Hie term "isolated, substantially 
homogenous" as used with respect to an hK2 polypeptide is defined in terms 

25 of the methodologies discussed herein below. 

As used herein, the term "recombinant nucleic acid," i.e., 
"recombinant DNA" refers to a nucleic acid, i.e., to DNA that has been 
derived or isolated from any appropriate tissue source, that may be 
subsequently chemically altered in vitro, an later introduced into target host 

30 cells, such as cells derived from animal, plant, insect, yeast, fungal or 

bacterial sources. An example of recombinant DNA "derived" from a source, 
would be a DNA sequence that is identified as a useful fragment encoding 
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hK2, or a fragment or variant thereof, and which is then chemically 
synthesized in essentially pure form. An example of such DNA "isolated" 
from a source would be a useful DNA sequence that is excised or removed 
from said source by chemical means, e.g, by the use of restriction 
5 endonucleases, so that it can be further manipulated, eg., amplified, for use in 
the invention, by the rnethodology of genetic engineering. 

Therefore, "recombinant DNA" includes completely synthetic 
DNA sequences, semi-synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
10 sequences derived from introduced RNA, as well as mixtures thereof. 
Generally, the recombinant DNA sequence is not originally resident in the 
genome of the host target cell which is the recipient of the DNA, or it is 
resident in the genome but is not expressed 

The recombinant DNA sequence, used for transformation 
15 herein, may be circular or linear, double-stranded or single-stranded 
Generally, the DNA sequence is in the form of chimeric DNA, such as 
plasmid DNA, that can also contain coding regions flanked by control 
sequences which promote the expression of the recombinant DNA present in 
the resultant cell line. For example, the recombinant DNA may itself 
20 comprise a promoter that is active in rnammalian cells, or may utilize a 
promoter already present in the genome that is the transformation target. 
Such promoters include the CMV promoter, as well as the SV 40 late 
promoter and retroviral LTRs (long terminal repeat elements). Aside from 
recombinant DNA sequences that serve as transcription units for hK2 or 
25 portions thereof, a portion of the recombinant DNA may be untranscribed, 
serving a regulatory or a structural function. 

"Control sequences" is defined to mean DNA sequences 
necessary for the expression of an operably linked coding sequence in a 
particular host organism. The control sequences that are suitable for 
30 prokaryotic cells, for example, include a promoter, and optionally an operator 
sequence, and a ribosome binding she. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 
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"Operably linked" is defined to mean that the nucleic acids are 
placed in a functional relationship with another nucleic acid sequence. For 
example, DNA for a presequence or secretory leader is operably linked to 
DNA for a polypeptide if it is expressed as a preprotein that participates in 
5 the secretion of the polypeptide; a promoter or enhancer is operably linked to 
a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translation. Generally, "operably linked" means that the DNA 
sequences being linked are contiguous and, in the case of a secretory leader, 
10 contiguous and in reading phase. However, enhancers do not have to be 

sites. If such sites do not exist, the synthetic oligonucleotide adaptors or 
linkers are used in accord with conventional practice. 

Aside from recombinant DNA sequences that serve as 

1 5 transcription units for hK2 or portions thereof a portion of the recombinant 
DNA may be untranscribed, serving a regulatory or a structural function. 

The recombinant DNA to be introduced into the cells further 
will generally contain either a selectable marker gene or a reporter gene or 
both to facilitate identification and selection of transformed cells from the 

20 population of cells sought to be transformed Alternatively, the selectable 
marker may be carried on a separate piece of DNA and used in a co- 
transformation procedure. Both selectable markers and reporter genes may be 
flanked with appropriate regulatory sequences to enable expression in the host 
cells. Useful selectable markers are well known in the art and include, for 

25 example, antibiotic and herbicide-resistance genes, such as rj£& hgt, dhfr. bar, 
aroA. dapA and the like. 

Reporter genes are used for identifying potentially transformed 
cells and for evaluating the functionality of regulatory sequences. Reporter 
genes which encode for easily assayable proteins are well known in the art. 

30 In general, a reporter gene is a gene which is not present in or expressed by 
the recipient organism or tissue and which encodes a protein whose expression 
is manifested by some easily detectable property, e.g., enzymatic activity. 
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Preferred genes include the chloramphenicol acetyl transferase gene (cat) from 
Tn9 of EjaolL the beta-glucuronidase gene (gus) of the uidA locus of Lj&li, 
and the luciferase gene from firefly Photinus pyralis- Expression of the 
reporter gene is assayed at a suitable time after the DNA has been introduced 

5 into the recipient cells. 

Other elements functional in the host cells, such as introns, 
enhancers, polyadenylation sequences and the like, may also be a part of the 
recombinant DNA. Such elements may or may not be necessary for the 
function of the DNA, but may provide unproved expression of the DNA by 

10 affecting transcription, stability of the mRNA, or the like. Such elements may 
be included in the DNA as desired to obtain the optimal performance of the 
transforming DNA in the cell. 

The general methods for constructing recombinant DNA which 
can transform target cells are well known to those skilled in the art, and the 

15 same compositions and methods of construction may be utilized to produce 
the DNA useful herein. For example, J. Sambrook et al., Molecular Cloning; 
a T shnratotv Manual. Cold Spring Harbor Laboratory Press (2d ed, 1989), 
provides suitable methods of construction. 

The recombinant DNA can be readily introduced into the target 

20 cells by transfection with an expression vector comprising cDNA encoding 
hK2, for example, by the modified calcium phosphate precipitation procedure 
of C. Chen et al., Mnl Cell. Biol.. 1 2745 (1987). Transfection can also be 
accomplished by lipofectin, using commercially available kits, e.g., provided 
by BRL. 

25 Suitable host cells for the expression of hK2 polypeptide are 

derived from multicellular organisms. Such host cells are capable of complex 
processing and glycosylation activities. In principle, any higher eukaryotic 
cell culture is workable, whether from vertebrate or invertebrate culture. 
Examples of invertebrate cells include plant and insect cells. Numerous 

30 baculoviral strains and variants and corresponding permissive insect host cells 
from hosts such as Spodoptem frugipenfo (caterpillar), Aedes aegypti 
(mosquito), Aedes dbopictus (mosquito), Drosophxla melatogaster (fiuitfly), 
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and Bombyx mori have been identified See, e.g., Luckow et al., 
Biotechnology. £; 47 (1988); Miller et al., in Genetic Engineering. J. \L 
Setlow et al., eds., Vol. 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda 
et al., Nature. 315:592 (1985). A variety of viral strains for transfection are 
5 publicly available, e.g., the L-l variant of Autognpha cdifomica NPV and the 
Bm-5 strain of Bombyx mori NPV, and such viruses may be used, preferably 
for transfection of Spodoptem fivgipenh cells. 

Recovery or isolation of a given fragment of DNA from a 
restriction digest can empoly separation of the digest on polyacrylamide or 

10 agarose gel by electrophoresis, identification of the fragment of interest by 
compurisoTi of its mobility versus that cf irwriver DNA fragn<ents cf knov.Ti 
molecular weight, removal of the gel section containing the desired fragment, 
and separation of the gel from DNA For example, see Lawn et al., Nucleic 
AcjdiEfiL, 2, 6103-61 14 (1981), and Goeddel et al., Nucleic Acids Res,, & 

15 4057(1980). 

"Southern analysis" or "Southern blotting" is a method by 
which the presence of DNA sequences in a restriction endonuclease digest of 
DNA or DNA-containing composition is confirmed by hybridization to a 
known, labeled oligonucleotide or DNA fragment. Southern analysis typically 

20 involves electrophoretic separation of DNA digests on agarose gels, 

denaturation of the DNA after electrophoretic separation, and transfer of the 
DNA to nitrocellulose, nylon, or another suitable membrane support for 
analysis with a radiolabeled, biotinylated, or enzyme-labeled probe as 
described in sections 9.37-9.52 of Sambrook et al., 

25 "Northern analysis" or "Northern blotting** is a method used to 

identify RNA sequences that hybridize to a known probe such as an 
oligonucleotide, DNA fragment, cDNA or fragment thereof, or RNA fragment. 
The probe is labeled with a radioisotope such as 32-P, by biotinylation or with 
an enzyme. The RNA to be analyzed can be usually electrophoretically 

30 separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, 
nylon, or other suitable membrane, and hybridized with the probe, using 



WO 95/30758 



PCT/US95/06I57 



13 

standard techniques well known in the art such as those described in sections 
7.39-7.52 of Sambrook et al., supra 

"Polymerase chain reaction" or "PCR" refers to a procedure or 
technique in which amounts of a preselected piece of nucleic acid, RNA 

5 and/or DNA, are amplified as described in US. Patent No. 4,683,195. 
Generally, sequence information from the ends of the region of interest or 
beyond is employed to design oligonucleotide primers. These primers will be 
identical or similar in sequence to opposite strands of the template to be 
amplified PCR can be used to amplify specific RNA sequences, specific 

10 DNA sequences from total genomic DNA, and cDNA transcribed from total 
cellular RNA, bacteriophage or plasmid sequences, and the like. See 
generally Mullis et al., Cold Spring HaborSymp, Quait. Biol, 51, 263 
(1987); Erlich, ed, PCR Technology, (Stockton Press, NY, 1989). 

"Stringent conditions" are those that (1) employ low ionic 

15 strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 
M sodium citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50°C, or (2) 
employ during hybridization a denaturing agent such as formarnide, for 
example, 50% (vol/vol) formarnide with 0.1% bovine serum albumin/0. 1% 
Ficoll/0.1% polyvmylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 

20 with 750 mM NaCl, 75 mM sodium citrate at 42°C. Another example is use 
of 50% formarnide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM 
sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardfs 
solution, sonicated salmon sperm DNA (50 ugfail), 0.1% SDS, and 10% 
dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC and 0.1% SDS. 

25 When hK2 polypeptide is expressed in a recombinant cell other 

than one of human origin, the hK2 polypeptide is completely free of proteins 
or polypeptides of human origin. However, it is necessary to purify hK2 
polypeptide from recombinant cell proteins or polypeptides to obtain 
preparations that are substantially homogeneous as to hK2 polypeptide. For 

30 example, the culture medium or lysate can be centrifuged to remove 

particulate cell debris. The membrane and soluble protein fractions are then 
separated The hK2 polypeptide may then be purified from the soluble 
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protein fraction and, if necessary, from the membrane fraction of the culture 
lysate. HK2 polypeptide can then be purified from contaminant soluble 
proteins and polypeptides by fractionation on irranunoaffirity or ion-exchange 
columns; ethanol precipitation; reverse phase HPLC; chromatography on silica 

5 or on a cation-exchange resin such as DEAE; crffomatofocusing; SDS-PAGE; 
ammonium sulfate precipitation; gel filtration using, for example, Sephadex 
G-75; or ligand affinity chromatography. 

Once isolated from the resulting transgenic host cells, 
derivatives and variants of the hK2 polypeptide can be readily prepared. For 

10 example, amides of the hK2 polypeptides of the present invention may also be 
~™o-~4 Ki, t^Knimi^ well known in the art for converting a carboxylic acid 

^l^j^^w 1 - 

group or precursor, to an amide. A preferred method for amide formation at 
the C-terminal carboxyl group is to cleave the polypeptide from a solid 
support with an appropriate amine, or to cleave in the presence of an alcohol, 

15 yielding an ester, followed by aminolysis with the desired amine. 

Salts of carboxyl groups of the hK2 polypeptide may be 
prepared in the usual manner by contacting the peptide with one or more 
equivalents of a desired base such as, for example, a metallic hydroxide base, 
e.g., sodium hydroxide; a metal carbonate or bicarbonate base such as, for 

20 example, sodium carbonate or sodium bicarbonate; or an amine base such as, 
for example, triethylamine, triethanolarnine, and the like. 

N-acyl derivatives of an amino group of the present 
polypeptides may be prepared by utilizing an N-acyl protected amino acid for 
the final condensation, or by acyiating a protected or unprotected peptide. O 

25 acy! derivatives may be prepared, for example, by acylation of a free hydroxy 
peptide or peptide resin. Either acylation may be carried out using standard 
acyiating reagents such as acyl halides, anhydrides, acyl imidazoles, and the 
like. Both N- and Oacylation may be carried out together, if desired In 
addition, the internal hK2 amino acid sequence of Fig. 4 can be modified by 

30 substituting one or two conservative amino acid substitutions for the positions 
specified, including substitutions which utilize the D rather than L form. The 
invention is also directed to variant or modified forms of the hK2 polypeptide 
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of Fig. 4. One or more of the residues of this polypeptide can be altered, so 
long as antigenic function is retained. Conservative amino acid substitutions 
are preferred-that is, for example, aspartic-glutamic as acidic amino acids; 
lysine/arginine/riistidine as basic amino acids; leuche/isoleucine, 

5 methionine/valine as hydrophobic amino acids; 

serine/glycine/alanine/uireciune as hydrophilic amino acids. 

Acid addition salts of the polypeptides may be prepared by 
contacting the polypeptide with one or more equivalents of the desired 
inorganic or organic acid, such as, for example, hydrochloric acid Esters of 

10 carboxyl groups of the polypeptides may also be prepared by any of the usual 
methods known in the art 

Once isolated, hK2 polypeptide and its antigenically active 
variants, derivatives and fragments thereof can be used in assays for hK2 in 
samples derived from biological materials suspected of containing hK2 or 

15 anti-hK2 antibodies, as disclosed in detail in Serial No. 08/096,946. For 
example, the hK2 polypeptide can be labelled with a detectable label, such as 
via one or more radiolabeled peptidyl residues, and can be used to compete 
with endogenous hK2 for binding to anti-hK2 antibodies, i.e., as a "capture 
antigen" to bind to anti-hK2 antibodies in a sample of a physiological fluid, 

20 via various competitive immunoassay format for hK2 which uses immobilized 
anti-hK2 antibodies is carried out by: 

(a) providing an amount of anti-hK2 antibodies attached to a solid 
surface; 

(b) rnbcing the sample of physiological fluid to be tested with a 
25 known amount of hK2 polypeptide which comprises a 

detectable label, to produce a mixed sample; 

(c) contacting said antibodies on said solid surface with said mixed 
sample for a sufficient time to allow imrnunological reactions 
to occur between said antibodies and said hK2, and between 

30 said antibodies and said labelled polypeptide; 

(d) separating the solid surface from the mixed sample; 
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(e) detecting or determining the presence or amount of labelled 
polypeptide either bound to the antibodies on the solid surface 
or remaining in the mixed sample; and 
(0 detentiining from the result in step (e) the presence or amount 

5 of said hK2 in said sample. 

In another format which can detect endogenous hK2 in a 
sample by a competitive inhibition immunoassay, a known amount of anti- 
hK2 antibody is added to a sample containing an unknown amount of 
endogenous hK2. The known amount is selected to be less than the amount 

10 required to complex all of the hK2 suspected to be present, e.g., that would be 
present in z ssmple of the same amount of physiologic?.! fluid obtnincd from a 
patient known to be prostate cancer. Next, a known amount of the hK2 
polypeptide of the invention or a subunit thereof; comprising a detectable 
label is added If endogenous hK2 is present in the sample, fewer antibodies 

15 will be available to bind the labelled hK2 polypeptide, and it will remain free 
in solution. If no endogenous hK2 is present, the added labelled polypeptide 
will complex with the added anti-hK2 antibodies to form binary complexes. 
Next, the binary antibody-antigen complexes are precipitated by an anti- 
mammal IgG antibody (sheep, goat, mouse, etc.). The amount of radioactivity 

20 or other label in the precipitate (a ternary complex) is inversely proportional 
to the amount of endogenous hK2 that is present in the sample, e.g., a pellet 
containing reduced amounts of radioactivity is indicative of the presence of 
endogenous hK2. 

Alternatively to the conventional techniques for preparing 

25 polyclonal antibodies or antisera in laboratory and farm animals, monoclonal 
antibodies against hK2 polypeptide can be prepared using known hybridoma 
cell culture techniques. In general, this method involves prepared an 
antibody-producing fused cell line, e.g., of primary spleen cells fused with a 
compatible continuous line of myeloma cells, and growing the fused cells 

30 either in mass culture or in an animal species from which the myeloma cell 
line used was derived or is compatible. Such antibodies offer many 
advantages in comparison to those produced by inoculation of animals, as they 
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are highly specific and sensitive and relatively "pure t1 immunochemically. 
Immunologically active fragments of the present antibodies are also within the 
scope of the present invention, e.g., the f(ab) fragment, as are partially 
humanized monoclonal antibodies. 
5 The invention will be further described by reference to the 

following detailed examples. 

ExamrJeJ* 

Constmction of HK2 expression vectois 

10 (A) Generation of recombinant baculoviruses containing pphK2 and mhK2 
coding sequences 

A cDNA (approximately 820 bp long) encoding the entire 
prepn>hK2 (pphK2) (from nucleotide #40 to #858 relative to the start site of 
the pphK2 transcript), as shown in Fig. 5, was synthesized from RNA of 

1 5 human BPH tissue using reverse-transcription polymerase chain reaction (RT- 
PCR) technology with a pair of hK2 specific oligonucleotide primers 
(5'ACGCGGATCCAGCATCTCX3GA SEQ ID NO: 2 and 

S'ACAGCTCCAGTTTACTAGAGGTAGGGGTGGGAC 3* SEQ ID NO:3). 
This cDNA was generated such that 5* and 3' ends (with respect to pphK2 

20 sense sequence) were bracketed with BamHl and Pst 1 sequences 

respectively. The cDNA was then purified by agarose gel electrophoresis, and 
digested with BamHl and Pst 1 restriction enzymes. The restricted cDNA 
was ligated with the BamHl -Pst 1 digested pVL1393 plasmid vector and 
transformed into the E. coli HB101 strain. E. coli harboring pphK2 

25 cDNA/pVL1393 plasmid vector were selected and verified by restriction 

enzyme mapping and DNA sequencing. Plasmid pphK2 cDNA/pVL1393 was 
mass-produced in E. coli and purified by CsCl gradient ultra-centrifugation. 

cDNA encoding the mature hK2 was synthesized using PCR 
with the aforementioned pphK2 cDNA as the template plus a pair of hK2 

30 oligonucleotides (5ACC£GGATCCAGCATTjIX3GGA( 
SEQ ID NO: 2 and 

5'ACCGGAATTCATGATIXjrGGGAGGCTCGGAG SEQ ID NO: 4). 
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As noted, the 3* end oligonucleotide was the same 3' end oligonucleotide used 
for synthesizing the pphK2 cDNA. However, the 5* end oligonucleotide was 
different from the 5' oligonucleotide used for the pphK2 cDNA, and therefore 
generates a cDNA coding for the mature form of hK2 (mhK2), as shown in 
5 Fig. 6. The mhK2 cDMA was bracketed with EcoRl and Pstl sequences at 
the 5' and 3' ends respectively. The protein produced from the mhK2 cDNA 
will gain an exogenous methionine at its N-terminus. The mhK2/pVL1393 
vector was generated and purified as described for pphK2/pVL1393. The 
DNA sequence analysis for pphK2 and mhK2 in pVL1393 showed that one 
10 nucleotide (#805) has been altered (G to T) in a silent mutation. 

pplixs^/p VLU7J ui nuuv^/p vlj! jyj x-/ixn \x jig; wac 

cotransfected with a linearized Baculogold DNA (0.5 ug Pharmingen, San 
Diego, CA) into 5^9 insect cells according to Pharmingen instructions (S. 
Gruenwold et al., baculovirus expression vector system: Procedures and 

15 Methods Manual, Pharmingen, San Diego, CA (1993)). Four to six days after 
the transfection, S(9 cell spent medium containing viral particles was harvested 
and used to infect fresh Sj9 cells to amplify viral titers. Total RNA was 
isolated for Northern blot analysis of authentic pphK2 or mhK2 transcript 
using hK2 cDNA as a probe. Further proof of pphK2 or mhK2 transcript 

20 expressed in recombinant virus infected S/9 cells was obtained by RT-PCR 
and DNA sequencing. Pure recombinant baculovirus containing pphK2 or 
mhK2 were obtained by secondary or tertiary plaque purification protocol 
according to instructions from Pharmingen (S. Gruenwold et al., cited above). 

25 Sample ^ 

feneration of prokmvotic expression vector 

A 0.8 kb fragment representing the entire preprohK2 (pphK2) 
coding sequence was generated by polymerase chain reaction (PCR) using 
primers A (5TATACATATGTGGGACCTGGTTCTCTCC3' SEQ ID NO.: 11) 

30 and B (5 , ATATCKJATCCTCAGGGGTTCKJCTGCGATGGT3 , SEQ ID NO: 
12) and plasmid pVL1393 containing pphK2 as the template. The pphK2 
bacterial expression vector (pBPPHK2) was prepared by standard DNA 
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cloning technology, (Sambrook, cited above), to subclone this 0.8 kb fragment 
into the Ndel/BamHl site of the plasmid pPHS579 (a gift from Dr. H. 
Hsiung, Eli Lilly Co, Inc.) under the control of T7 promoter. The DNA of 
the entire insert plus the cloning sites was sequenced to confirm that no 
5 cloning artifacts had occurred and to ensure that no anomalies in the sequence 
had been generated by PCR pBPPHK2 was transformed into E. coli BL21 
(DE3)Lys S (Novagen, Inc., Madison, WI). 

Example 3. 

10 Generation of a mammalian expression vector 

To express hK2 in mammalian cell lines, a 0.8 kb fragment 
representing the entire preprohk2 (pphK2) coding sequence was generated by 
PCR using primers 
A^ATATXXiATCCATATC^ 

15 (SEQ ID NO: 17) and 

B(5'ATATX}GATCCTCAGGGGTTOGCTGCGATGGT3 r ) (SEQ ID NO: 12) 
and plasmid pVL1393 containing pphK2 as the template. The mammalian 
expression vector (pGThK2) was prepared using standard DNA cloning 
technology (Sambrook, 1989), to clone this 0.8 kb fragment into the Bel 1 site 

20 of the plasmid pGT-d (a gift from Dr. Brian Grinnell, Eli Lilly, Inc.) under 
control of the GBMT promoter. The DNA of the entire insert plus the 
cloning sites was sequenced to confirm that no cloning artifacts had occurred 
and to ensure that no anomalies in the sequence had been generated by PCR. 
AV12-664 (ATCC CRL-9595), a cell line derived from a adenovirus-induced 

25 tumors in Syrian hamster, was grown in Dulbecco's modified Eagle's medium 
supplemented with 10% fetal bovine serum (D10F) and transfected with 
plasmid pGThK2 using the calcium phosphate method 

Example 4. 

30 Identification of recombinant pphK2 and mhK2 
A. baculQYirus - insect cell system 
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SJ9 cells (7xl0*/jplate) were seeded onto 100 mm Coming 
plates with 10% fetal calf serum - Graces medium at room temperature for 1 
hr. After attachment on culture plates, cells were infected with wild type or 
recombinant baculovirus in serum free Exceil-400 medium and incubated at 

5 27°C. Control cells were grown in the absence of virus. At designated times 
(24-96 hr) cells were placed in fresh S/9-000 media deficient of either 
methionine or methionine and cysteine for 45-60 min at 27°C, then incubated 
with Promix (0.143 mCi/plate; a mixture of [ 35 S]-methionine and [ 35 S]- 
cysteine; 1,000-1,400 Ci/mmol; Amersham) in serum free and 

10 methionine/cysteine deficient &9IIOO medium (Biofluids) for 5-8 hr or 20 hr. 
After the end of each Incubation time, cells and spent media were separated 
by centrifugation (1,000 rpm; Beckman J-6B; Beckman, Fullerton, CA). Cells 
were washed and centriruged (13,000 rpm; Biofuge 13, Baxter) twice. The 
washed cells were lysed by freeze/thaw in a detergent buffer (10 mM Tris, pH 

15 7.5; 130 mM NaCl, 1% Triton X-100, 10 mM NaF; 10 mM NaPi, lOmM 
Nappi, pH7.5) or H 2 0 and centrifuged to obtain cytosol and insoluble cellular 
fractions. Protein contents of the above samples were determined by either 
the Bradford or Lowry method (BioRad, Inc., Melville, N.Y.). The above 
spent media, cystosol and insoluble cellular fraction were frozen and stored 

20 separately until used A duplicate set of samples were prepared without 35 S- 
labeling. 

For SDS-polyacrylamide gel electrophoresis (PAGE) analysis of 
expression of hK2 protein in S/9 cells, samples were added to sample buffer 
(U.K. Laemmli, MaluK, 221 680 (1970)), heated at 95'C for 5 minutes and 

25 subjected to SDS-PAGE under reducing conditions. 

Northern blot analysis was routinely used to screen and isolate 
clonal recombinant baculoviruses expressing pphK2 or mhK2 mRNA A 
comparison of the corresponding lanes in both autoradiographs of the 
Northern blot and photographs of ethidium bromide staining of RNA shows 

30 that mRNA for pphK2 or mhK2 was present in recombinant virus infected S/9 
but not in wild type virus-infected cells. Moreover, each of the pphK2 or 
mhK2 mRNA positive lanes represents RNA isolated from Sfi cells infected 
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with recombinant viruses derived from a single viral plaque. Thus, the results 
suggest that high frequency (100%) of recombinant baculovirus containing 
either pphK2 or mhK2 was obtained from the above cotransfection. 
Furthermore, the sequences of pphK2 or mhK2 expressed in viral infected Sfi 
5 cells were confirmed by a combination of RT-PCR, cloning and DNA 
sequencing. 

To determine whether the pphK2 protein is expressed in the 
insect cell Sfi, time course studies using 35 S-labeling of de novo synthesis of 
protein was performed and detected by SDS denaturing polyacrylamide gel 

10 electrophoresis (PAGE). As seen in the autoradiograph (Fig. 1), a unique 
protein (about 28 KDa) was found in pphK2-recombinant virus-infected Sfi 
cells at 35-74 hour post-infection. This band was missing in uninfected cells 
or cells infected with wild type virus. The viral polyhedron protein (about 32 
KDa) was found (Fig. 1) as expected in Sfi cells infected with wild type 

15 virus, whereas it was not expressed by recombinant virus (Fig. 1). The 
protein was detected in cytosol when subcellular fractions (cytosol vs. 
insoluble fraction) was prepared by lysing cells with H 2 0 and freeze-thaw, 
whereas this 28 KD protein was detected in insoluble fraction when prepared 
by a detergent buffer and freeze-thaw (data not shown). 

20 The mhK2 protein was also expressed in the insect cell §9, 35 S- 

labeling of de novo synthesized protein was performed As seen in the 
autoradiograph (Fig. 2), a unique protein (about 28 KDa) was found in the 
insoluble fraction of mhK2-recombinant virus-infected ^9 cells at 48 hours 
post-infection. This band was missing in uninfected cells or cells infected 

25 with wild type virus. The viral polyhedron protein (about 32 KDa) was found 
in wild type virus-infected cells, whereas it was not expressed in cells infected 
with recombinant virus (Fig. 1). When the cytosol fraction was examined, no 
28 KDa band was observed 

30 p F Cnli system 

Plasmid pBPPHK2 was transformed into E^CQli BL21 (DE3) 
pLysS (Novagen, Inc., Madison, WI). This strain contains a chromosomal 
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copy of T7 RNA polymerase under the control of inducible LacUVS 
promoter. Upon addition of IPTG (isopropyi-^I>thiogalactopyranoside) the 
expression of the T7 RNA polymerase is induced which in turn activates the 
17 promoter resulting in overproduction of the gene product under control of 
5 this promoter. To determine whether the product of the ppHK2 gene would 
be expressed from pBPPHK2, single colonies of BL21 E. Coli transformed 
with pBPPHK2 were grown to O.D^ = 0.2 in 10 ml LB media plus 
ampicillin (lOOugfail) and induced with 0.4 mM IPTG (Sigma, Inc.). Cells 
were harvested 2 hours after induction by centrifugation and resuspended in 
10 1.5 ml SDS/PAGE sample buffer (U.K. Laernmli, Nanus, 221 680 1970) 

i r cnc/nArx Mn Kr^:„ TUa fko TDrm,l*%A*tr**A ^iltim> 

OL/J/i I-k\Ji-t fUUUJ&K>. IIIV WWII fJVllW* UVIII **»w *A * W u*w*mu 

was resuspended in 0.05M Tris, pH 8.0 (at 9ml/gm cell pellet) and stirred at 
room temperature (25°Q r.t) for 1 hour. Lysozyme (4 mg/ml) was added to 
this suspension (at 1 ml/gm cell pellet) and the suspension was stirred at r.t. 

15 for 30 min followed by incubation on ice for 30 min. The suspension was 
sonicated for 2 min at 150 watts and centrifuged at 3000xg to isolate the 
inclusion bodies. Inclusion bodies were resuspended in running buffer (25 
mM Tris, 192 mM glycine, 0.1% SDS) and after centrifugation both the pellet 
and the supernatant were analyzed by SDS/PAGE. 

20 About 90% of the pphK2 was found to be in the supernatant 

fraction which indicated mat pphK2 is soluble in 0.1% SDS. To prepare 
samples for amino acid sequence analysis, 20ul of inclusion body lysate was 
subjected to SDS/PAGE on a 4-20% gradient gel (BIORAD, Inc., Melville, 
N.Y.). The protein was blotted from the gel onto 0.2u PVDF paper (BIO- 

25 RAD) and stained with Coomassie blue. The protein band of interest was cut 
out from the blot and subjected to amino acid sequencing using a protein 
sequencer model 477A (Applied Biosystem, Inc., Foster City, CA). 

The induced cells overproduced large amounts of a polypeptide 
with apparent molecular mass of about 28kd (Figure 3). Densitometric 

30 analysis indicated that this protein comprised approximately 40% of total 
cellular protein. The size of this protein as determined by an SDS-PAGE gel 
was comparable to that predicted from coding sequence for pphK2. To 
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confirm that this protein is pphK2, the sequence of the first 10 amino acids 
(MWDLVLSIAL) (SEQ ID NO: 13) from the N-terminus was determined. 
This sequence agrees perfectly with that deduced from the DNA sequence of 
pphK2 cDNA. As noted, it has different identity from the first 10 amino 
5 acids of both pphKl (MWrXVLCLAL) (SEQ ID NO: 14)andpphK3 
(MWVPWFLTL) (SEQ ID NO: 15). It also shows that this protein is not 
modified or processed at the N-terminus either during or after expression in E. 
coli. These results demonstrate that we were able to accurately express 
pphK2 in E. coli from pBPPHKZ 

10 

c Mammalian System 

1. Isolation and Purification of protein 

Plasmid pGThK2 was transformed into hamster cell line AVI 2- 
664 (ATCC-CRL-9595). To determine whether the product of the ppHK2 

15 gene would be expressed from pGThK2, AV12-pGThK2 #2 was grown in 
D10F + 200nM MTX. At about 60% confluency the cells were washed with 
Hank's balanced salt solution and resuspended in serum-free HH4 medium 
The spent medium was collected after 7 days (serum-free spent medium) and 
stored at -20°C. Figure 8 depicts a SDS-PAGE confirming expression of 

20 recombinant pphK2 in a mammalian cell line. AV12-pGThK2 (Lane 4-6) and 
AV12-pGT-d (Lane 3) clonal cell lines were grown in D10F media About 
300ul of spent medium from the above clones were concentrated and 
subjected to SDS/PAGE along with See Blue MW marker (lane 1) and pphK2 
lysate from E. coli cells (lane 2), The gel was blotted onto nitrocellulose 

25 paper and immunoblotted using a 1/1000 dilution of anti-pphK2 rabbit 

antiserum HRP-goat anti-rabbit was used as the secondary probe and the blot 
was developed by DAB plus H^. Lane 3 (AV12-pGT-d) is AV12 
transfected with vector without insert. 

To purify the protein, the serum-free spent medium was 

30 concentrated from 5-10 fold by ultrafiltration with a 10 kDa molecular weight 
cutoff membrane then dialyzed overnight at 4°C versus 50 mM sodium 
bicarbonate, pH 8, Samples were filtered with 0.2 u filters and then pumped 
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directly onto a TSK DEAE-5PW HPLC column (21 mm X 150 mm) at a flow 
rate of 5 mL/min. Buffer A contained 50 mM sodium bicarbonate, pH 7.9 
Buffer B contained 50 mM sodium bicarbonate plus 0.5 M sodium chloride, 
pH 7.6. The elution profile shown in Figure 9 was developed with a gradient 

5 from 0-50% Buffer B over 35 min; 50-100% B from 35-40 min and isocratic 
elution at 100% B for 5 min before re-equilibration in Buffer A. The flow 
rate was 5mL7min throughout 

DEAE fractions were assayed for the presence of hK2 by 
ELISA using rabbit arut-r#hK2 as primary antibodies. The EUSA assayed 

10 showed a peak of hK2 activity which eluted at approximately 0-2M NaCl 

/.i *u~ *— : — ir;™ Q^ \i/V»i^k /rtrwpl atpH impII with fhp 

^miowii as uit; mangle iuiw ui x igwuw //, "»"~-« — — — 

appearance of a 34 kDa band of protein seen by SDS-PAGE in the same 
fractions (data not shown). 

Fractions with hK2 activity were pooled and concentrated by 

15 ultrafiltration with 10 kDa membranes to approximately 5-8 mL where upon 
solid ammonium sulfate was added to make a final concentration of 1 M 
This sample was then injected onto a PolyLC. polypropyl aspartamide column, 
1000 A pore size, 4.6 mm X 200 mm, to resolve protein by hydrophobic 
interaction chromatography (HIC, see Figure 10). Buffer A was 20 mM Na 

20 phosphate, 1.2 MNa sulfate pH 6.3 and Buffer B was 50 mMNa phosphate, 
5% 2-propanol, pH 7.4. The elution gradient was 0-20% B over 5 min; 20- 
55% B from 5-20 min, isocratic at 55% B from 20-23 min, 55-100% B from 
23-25 min; isocratic at 100% B for 2 min before re-equilibration Buffer A 
The flow rate was 0.7 mUmin. Greater than 90% of the A^ was not 

25 retained on FflC column. The main peak retained on HIC, which eluted at 22 
min, also showed the highest peak of activity by EUSA assay (triangle line, 
Figure 10). 

HIC fractions which tested positive for hK2 on ELISA were 
pooled, ultrafilter concentrated as above to a volume less than 1 mL then 
30 injected on a 10/30 Pharmacia S12 size exclusion column equilibrated in 100 
mM ammonium acetate. The flow rate was 0.7 mL/mia When the 22 min 
peak from HIC was resolved by size exclusion chromatography, typically 
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about 80-90% of the protein eluted at 19.4 min, a retention time 
consistent with a protein of approximately 34 kDa (Figure 1 1). The only 
other protein peak on SEC, during at 16.7 min, corresponded to an about 
70KDa protein seen also in previous purification steps. 
5 To examine the efficiency of our purification scheme, 1 .5 ug of 

purified phK2 was subjected to SDS/PAGE in the presence or absence of JJ- 
mercaptoethanol (BME), and the gel was stained with silver. Results showed 
that the phK2 in our sample was about 95% pure (Fig. 12). It also showed 
that pro-hK2 migrated at about 30 KD in the absence of BME, and it 

10 migrated at about 34 kDa in the presence of BME. This pattern is similar to 
that observed for the PSA purified from seminal fluid (Fig. 12). 

Recombinant phK2 is recognized by rabbit anti-pphKZ rabbit 
anti-PSA and a murine monoclonal antibody directed against a polypeptide 
covering amino acids 41-56 of hK2, when analyzed on WESTERN blots or 

15 when dried down on microtiter plates. However, phK2 was not detectable by 
these antibodies in sandwich assays. These results further demonstrate that 
the phK2 and PSA are corrformationally different and the antibodies currently 
available to PSA or hK2 can not detect phK2 in its native form. Furthermore, 
phK2 was not detectable by the Tandem R or free-PSA assays (immunological 

20 assays for detection of PSA in serum). 

A sample of the hybridoma (HK1A 523.5) secreting the murine 
monoclonal antibody has been deposited in the American Type Culture 
Collection, Rockville, MD, and assigned ATCC HB-11876. 

25 2. Amino Acid Analysis and Protein Sequencing of phK2 

The peak collected off size exclusion chromatography (SEC) in 
ammonium acetate was lyophilized to remove the buffer then reconstituted in 
water. An aliquot (2.5ug)of this sample was loaded on a Porton membrane 
(Beckman instruments) and subjected to automated N-terminal sequence 
30 analysis on an Applied Biosystems model 477A protein sequencer which 
yielded the following sequence: 
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Val-Pro-I^Ile<Jln-Ser-Arg-Ile-Val<jIy^ly-Trp<]lu- (SEQ ID NO: 18). An 
aliquot of the same sample in water was also hydrolyzed in gaseous 6 N HCI 
under vacuum for 20 h at 112 °C then reconstituted in 0.1N HCI and analyzed 
on an Hewlett Packard Aminoquant amino acid analyzer utilizing pre-column 
5 derivatization of amino acids with OPA for primary and FMOC for secondary 
amines. 

No competing sequence was evident from the profile of amino 
acids released sequentially by the Edman degradation procedure. By analogy 
to PSA this protein is pro hK2, since the known sequence of mature PSA has 
10 been shown to begin with Ileu-Val-Gly-etc and pro PSA has been postulated 
to have an extra 7 amino acids at the N-terminus. Amino acid analysis of this 
protein yielded an amino acid composition consistent with the recombinant 
sequence of prohK2. These results demonstrate that pphK2 was accurately 
expressed in the mammalian cell line AVI 2-664 from pGThK2. 

15 

ErampleS. 

Production of antibodies to recombinant pphKZ 

A. £ Coli System 

To prepare pphK2 for rabbit immunization, the inclusion bodies 

20 obtained from bacterial cultures of BL21 (pBpphK2) after IPTG induction as 
in Example 4B were resuspended in lOOul SDS/PAGE sample buffer/ml 
bacterial culture and electrophoresed on preparative SDS/PAGE. The pphK2 
band was excised and electroeluted from the gel into running buffer (25mM 
Tris, 192 mM glycine, 0.1% SDS) and used as the immunogea Two rabbits 

25 were each immunized with lOOug of the immunogen in complete FreuncTs 
adjuvant and were boosted twice in three week intervals with lOOug of the 
immunogen in incomplete Freund's adjuvant and PBS, respectively. Rabbit 
anti-pphK2 sera was obtained one week following the second boost The 
presence of anti-pphK2 in the rabbit antiserum was shown by ELISA (data not 

30 shown). Once confirmed by this method, the highest titer antiserum was 
tested on Western blots using lysates from IPTG induced or non-induced 
cultures of BI.21 (pBpphK2). It was evident that the antiserum contained 
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antibodies highly specific for the pphK2 protein since a protein band at about 
28kd corresponding to pphK2 was recognized only in the induced lysate. The 
antiserum also recognized the purified pphK2 further showing the specificity 
of the antibodies to pphK2. The above data demonstrate that the antibodies 
5 recognize the prepro-form of hK2. 

To delineate if the antiserum recognizes the mature form of 
hK2 (mhK2X mhK2 was expressed in E. coli as a glutathione S-transferase 
fusion protein (GST-rnhK2, 58kd), and the cell lysate was immunoblotted 
using anit-pphK2 rabbit antiserum It was evident that anti-pphK2 antiserum 

10 recognized the GST-mkK2, demonstrating that antibodies were at least in part 
against the mature region of pphK2. To examine the pattern recognized in 
seminal fluid by anti-pphK2 antibodies, seminal fluid was prepared from 
pooled semen as described by Sensabaugh and Blake, J. Urology. 142, 1523 
(1990), and immunoblotted with anti-pphK2 rabbit antiserum. The antiserum 

15 recognized a major band at about 34kd plus several minor bands at lower 
MW. The pre-immune serum did not recognize any bands in any of the 
above experiments, showing that the antibodies were generated by 
immunization. 

To determine whether there are any pphK2-specific antibodies 
20 in rabbit anit-pphK2 antiserum, the antibodies cross-reacting to PSA were 
absorbed out of the antiserum by a PSA affinity resin. Specifically, 1ml of 
the sera was diluted with lmL 100 mM HEPES, pR 7.5 and incubated with 
native PSA-bound Affigel-10 for 3.5 hours at 4°G The mixture was used 
poured into a column, the flow-through was collected and the column was 
25 washed with 30 ml HEPES buffer. Antibodies bound to the column (eluate) 
were eluted by acetic acid (IN, pH 4.0) and neutralized to pR 6.6 with 
NH*OR Native PSA was isolated from seminal fluid as described by 
Sensabaugh and Blake, cited above. ppPSA was purified from E_coJi 
transformed with plasmid pPHS579 (containing ppPSA under control of T7 
30 promoter) using a procedure analogous to pphK2 purification. 

The flow-through and the column eluate were tested for Abs 
recognizing pphK2, ppPSA and native PSA (PSA isolated from seminal fluid) 
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using Western blot analysis. It was evident that antibodies in the untreated 
rabbit anti-pphK2 antiserum recognized all three proteins indicating that 
pphK2, ppPSA and seminal fluid-PSA share some similar epitopes. However, 
while the column eluate contained antibodies that recognized all three protein, 
5 the flow-through contained antibodies that recognized only pphK2. This 
indicates that anti-pphK2 antiserum contains pphK2-specific antibodies and 
these antibodies can be isolated by PSA affinity absorption. This system 
enabled us to generate anti-pphK2 antibodies which recognize both pphK2 and 
mhK2. Thus, utilizing immunogenic and pure recombinant hK2 protein, 

10 generate rabbit antiserum was generated which contains pphK2-specific 
antibodies, providing a Valuable source for generating and screening for 
specific monoclonal antibodies. 

These examples describe the use of three heterologous 
expression systems (i.e. both prokaryotic and eukaryotic) for the successful 

15 expression of the hK2 polypeptide. Thus, the method of the invention enables 
production of large quantities of substantially pure hK2 polypeptide. The 
polypeptide can be used both to study its biological functions and to produce 
immunodetection reagents such as labelled hK2 polypeptide, labelled 
fragments thereof and antibodies thereto. The immunoreagents can provide a 

20 method to purify native hK2 and to study the properties of the purified native 
hK2 polypeptide. 

The pphK2 overproduced in E coli can be readily solubilized 
in 0.1% SDS, thus solubility is not a problem. This is in contrast to the 
expression of human salivary kallikrein protein, hKl, in E. coli, which was 

25 found in insoluble inclusion bodies (J. Wang, et al Biochem I» 226, 63 
(1991)). In contrast, the present invention yields almost pure protein which 
can be purified to homogeneity by preparative SDS-PAGR This purified 
recombinant pphK2 can be used for the generation of monoclonal and 
polyclonal antibodies. 

30 As shown above, Baculogold viral DNA can be used to 

generate a recombinant baculo virus containing pphK2 or mhK2. Use of 
Baculogold viral DNA provides high selection of positive recombinant 
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baculoviruses. Indeed, Northern blot analysis showed a high frequency of 
recombinant virus expressing pphK2 or mhK2 mRNA. Moreover, SDS-PAGE 
analysis showed that both pphK2 and mhK2 recombinant viruses produced 
unique proteins with sizes similar to the calculated molecular weights for 
5 pphK2 or mhK2. Although the levels of the recombinant hK2 expressed in 
insect system may not be as high as in E. coli, the hK2 protein produced in 
baculovirus-insect system may contain the secreted form which would be 
more like the natural form of the protein 

Plasmids pphK2/pVL1393 in E col. H13101 has been 

10 deposited in the American Type Culture Collection, Rockviile, MD, USA on 
May 2, 1994 under the provisions of the Budapest Treaty and have been 
assigned accession number ATCC 69614. 

The invention has been described with reference to various 
specific and preferred embodiments and techniques. However, it should be 

15 understood that many variations and modifications may be made while 
remaining within the spirit and scope of the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT; Mayo Foundation for Medical Education 
and Research 
Hybritech Incorporated 
Tiiidall, Donald J. 
Young, Oiarles Y.F. 
Saedi, Mohairmed S. 

(ii) TITLE OF INVENTION: Recombinant HK2 Polypeptide 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Schwegman, Lundberg & Woessner, P. A. 

(B) STREET: 3500 IDS Center 

(C) CITY: Minneapolis 

(D) STATE: MN 

(E) CCnNTTRY: USA 

(F) ZIP: 55402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) OCMKJTER: IEM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Raasch, Kevin W. 

(B) REGISTRATION NUMBER: 35,561 

(C) REFERENCE/DOCKET NUMBER: 150.148WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 612-339-0331 

(B) TELEFAX: 612-339-3061 
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(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC7TH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO:l: 

He Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
15 10 15 

Leu Val Ala Ser Arg Gly Arg Ala Val Cys Gly Gly Val Leu Val His 
20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys He Arg Asn Lys Ser Val 
35 40 45 

He Leu Leu Gly Arg His Ser Leu Phe His Pro Glu Asp Thr Gly Gin 
50 55 60 

Val Phe Gin Val Ser Thr Ser Phe Pro His Pro Leu Tyr Asp Met Ser 
65 70 75 80 

Leu Leu Lys Asn Arg Phe Leu Arg Pro Gly Asp Asp Ser Ser His Asp 
85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Glu Leu Thr Asp Ala Val 
100 105 110 

Lys Val Met Asp Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Thr Pro 
130 135 140 

Lys Lys Leu Gin Cys Val Gin Leu His Val He Ser Asn Asp Val Cys 
145 150 155 160 

Ala Gin Val His Pro Gin Lys Val Thr Lys Phe Met Leu Cys Ala Gly 
165 170 175 

Arg Trp Thr Gly Gly Lys Ser Thr Cys Ser Gly Asp Ser Gl£ Gly Pro 

Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Ser Glu 
195 200 205 

Pro Cys Ala Leu Pro Glu Arg Pro Ser Leu Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arg Lys Trp He Lys Asp Thr He Val Ala Asn Pro 
225 230 235 
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(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
ACGCGGATCC AGCATGTGGG ACCTG7ITCT CT 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 
ACAGCTGCAG TTTACTAGAG GTAGGQoTGG GAC 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC7TH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
AOCGGAATTC ATGATTGTGG GAGGCTGGGA GTGT 
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(2) INFORMATION FOR SBQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDETNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATICN: 10.. 792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GGATCCAGC ATS TGG GAC CTG GTT CTC TCC ATC QCC TTG TCT GTG GGG 48 
Met Tip Asp Leu Val Leu Ser He Ala Leu Ser Val Gly 
1 5 10 

TGC ACT GGT GCC GTG CCC CTC ATC CAG TCT COG ATT GIG GGA GGC TGG 96 
Cys Thr Gly Ala Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp 
15 20 25 

GAG TGT GAG AAG CAT TCC CAA CCC TOG CSG GIG GCT GTG TAC ACT CAT 144 
Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His 
30 35 40 45 

GGA TGG GCA CAC TGT GGG GGT GTC CTG GTS CM2 CCC CAG TOG GIG CTC 192 
Gly Trp Ala His Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu 
50 55 60 

ACA GCT GCC CAT TGC CTA AAG AAG AAT AGC CAG GTC TGG CIG GGT OQG 240 
Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg 
65 70 75 

CAC AAC CTG TTT GAG OCT GAA GAC AC3V GGC C7\G AGG GTC OCT GTC AGC 288 
His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser 
80 85 90 

CAC AGC TTC CCA CAC COS CTC TAC AAT ATS AGC CTT CTG AAG CAT CAA 336 
His Ser Phe Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin 
95 100 105 

AGC CTT AGA CCA GAT GAA GAC TCC AGC CAT GAC CTC ATC CIG CTC 03C 384 
Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg 
110 115 120 125 

CTG TCA GAG CCT GCC AAG ATC ACA GAT GIT GIG AAG GTC CIG GGC CIG 432 
Leu Ser Glu Pro Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu 
130 135 140 

CCC ACC CAG GAG CCA GCA CTG GGG ACC ACC TGC TAC GCC TCA GGC TOG 480 
Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp 
145 150 155 

GGC AGC ATC GAA OCA GAG GAG TTC TTG OGC CCC AGG ACT CTT CAG TGT 528 
Gly Ser He Glu Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys 
160 165 170 
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GTC AGC CTC CAT CTC CTG TCC AAT GAC ATS TGFT GCT AGA GCT TAC TCT 576 
Val Ser Leu His Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser 
175 180 185 

GfiG AAG GIG ACA GAG TTC ATG TIG TUT GCT GOG CTC TOG ACA GGT GGT 624 
Glu Lys Val Thr Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly 
190 195 200 205 

AAA GAC ACT TGT G3G GGT GAT TCT GOG GGT CCA CTT GTC TGT AAT OCT 672 
Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly Pro L^u Val Cys Asn Gly 
210 215 220 

GTTG CTT CAA GGT ATC ACA TCA TGG GGC CCT GAG CCA TGT GCC CTG CCT 720 
Val Leu Gin Gly He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro 
225 230 235 

GAA AAG CCT GCT GIG TAC ACCAAGGIGGIGCATTACGGGAAGT3G ATC 768 
Glu Lys Pro Ala Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp He 
240 245 250 

AAG TAC ACC ATC GCA GCC AAC CCC T3AGTGCCCC TGTCCCACCC CTACCTCTAG 822 
Lys Tyr Thr He Ala Ala Asn Pro 
255 260 

TAAACT3CAG 



(2) INFORMATION FOR SBQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 261 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Trp Asp Leu Val Leu Ser He Ala Leu Ser Val Gly Cys Thr Gly 
1 5 10 15 

' Ala Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu 
20 25 30 

Lys His Ser Glu Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala 
35 40 45 

His Cys Gly Gly Val Leu Val Hie Pro Gin Trp Val Leu Thr Ala Ala 
50 55 60 

His Cvs Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn I^u 
65 70 75 80 

Phe Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe 
85 90 95 

Pro His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg 
100 105 110 



WO 95/30758 



PCT/US95/06157 



35 



Pro Asp Glu Asp 
115 



Pro Ala 
130 

Glu Pro 
145 

Glu Pro 

His Leu 

Thr Glu 

Cys Gly 
210 

Gly He 
225 

Ala Val 
He Ala 



Lys He 

Ala Leu 

Glu Glu 

Leu Ser 
160 

Phe Met 
195 

Gly Asp 

Thr Ser 

Tyr Thr 

Ala Asn 
260 



Ser Ser His 

Thr Asp Val 
135 

Gly Thr Thr 
150 

Phe Leu Arg 
165 

Asn Asp Met 
Leu Cys Ala 



Ser Gly Gl 
213 

Tip Gly Pro 
230 

Lys Val Val 
245 

Pro 



Leu 

12 

Val Lys 

Cys Tyr 

Pro Arg 

Cys Ala 
185 

Gly Leu 
200 

Pro Leu 
Glu Pro 
His Tyr 



Met Leu Leu Arg 
125 

Val Leu Gly Leu 
140 

Ala Ser Gly Trp 
155 

Ser Leu Gin Cys 
170 

Arg Ala Tyr Ser 



Trp Thr Gly Gly 
205 

Val Cys Asn Gly 
220 

Cys Ala Leu Pro 
235 

Arg Lys Trp He 
250 



Leu Ser Glu 

Pro Thr Gin 

Gly Ser He 
160 

Val Ser Leu 
175 

Glu Lys Val 
190 

Lys Asp Thr 
Val Leu Gin 



Glu Lys Pro 
240 

Lys T^r Thr 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 720 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

GAATTC ATG AIT GIG GGA GGC TOG GAG TGT GAG AAG CAT TCC CAA CCC 48 
Met He Val Gly Gly^ Trp Glu Cys Glu Lys His Ser Gin Pro 

TOG CAG GIG GCT GTG TAG ACT CAT GGA TOG GCA CAC TGT GGG GGT GTC 96 
Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His Cys Gly Gly Val 
ll 20 25 30 

CIG GIG CAC CCC CAG TOG GIG CTC ACA GCT GCC CAT TCC CIA AAG AAG 144 
Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys 
35 40 45 
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AAT AGC CAG GTC TOG CIG GGT COG CAC AAC CIG TIT GAG OCT GAA GAC 192 
Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp 
50 55 60 

ACA GGC CAG AGG GTC OCT GTC AGC CAC AGC TTC CCA CAC COG CTC TAC 240 
Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr 
65 70 75 

AAT ATG AGC CTT CIG AAG CAT CAA AGC CTT AGA OCA CKT GAA GAC TCC 2B8 
Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser 
80 85 90 

AGC CAT GAC CTC ATG CTG CTC OGC CTG TCA GAG OCT GOC AAG ATC ACA 336 
Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys lie Thr 
95 100 105 110 

GAT GTT GIG AAG GTC CTG GGC CTG CCC ACC CAG GAG OCA GCA CIG GGG 384 
Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly 
115 120 125 

ACC ACC TGC TAC GCC TCA GGC TGG GGC AGC ATC GAA CCA GAG GAG TTC 432 
Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu Phe 
130 135 140 

TTG OGC CCC AGG ACT CTT CAG TGT GIG AGC CTC CAT CTC CTG TCC AAT 480 
Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn 
145 150 155 

GAC ATG TGT GCT AGA GCT TAC TCT GAG AAG GIG ACA GAG TTC ATG TTG 528 
Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu 
160 165 170 

TUT GCT GGG CTC TOG ACA GGT GGT AAA GAC ACT TUT GGG GGT GAT TCT 576 
Cys Ala Gly lieu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser 
175 180 185 190 

GGG GGT CCA CTT GTC TUT AAT GGT GIG CTT CAA GGT ATC ACA TCA TGG 624 
Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp 
195 200 205 

GGC CCT GAG CCA TGT GCC CTG CCT GAA AAG CCT GCT GIG TAC ACC AAG 672 
Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys 
210 215 220 

GIG GIG CAT TAC OGG AAG TGG ATC AAG TAC AOC ATC GCA GOC AAC CCC 720 
Val Val His Tyr Arg Lys Trp He Lys Tyr Thr He Ala Ala Asn Pro 
225 230 235 

TGAGIGCCCC TGTCCCACCC CTACCTCIAG TAAACTGCAG 760 
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(2) INFORMATION FOR SEQ ID MO: 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met lie Val Gly Gig Tip Glu Cys Glu Lys His Ser Gin Pro Trp Gin 

Val Ala Val Tyr Ser His Gly Itp Ala His Cys Gly Gly Val L*u Val 
20 25 

His Pro Gin Trp Val Leu Tnr Ala Ala His Cys Leu Lys Lys Asn Ser 
35 40 " 

Gin val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly 
50 55 
• Gin Arg Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met 
65 70 75 



Ser Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His 

Asp Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys lie Thr Asp Val 

r 100 105 

val Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ala Leu Gly Thr Thr 

Cys Tyr Ala Ser Gly Trp Gly Ser lie Glu Pro Glu Glu Phe Leu Arg 
130 I 35 A,u 

Pro Arg Ser Leu Gin Cys Val Ser Leu Hie Leu Leu Ser Asn Asp Met 

145 150 

Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala 

Gly Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly 

Pro Leu val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Pro 

195 200 * WiJ 

Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val 

210 215 
His Tyr Arg Lys Trp He Lys Tyr Thr He Ala Ala Asn Pro 
225 230 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANPEDNES5: doable 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: ClNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .732 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GTG CCC CTC ATC CAG TCT CGG ATT GTG GGA GGC TGG GftG TGT GAG AAG 48 
Val Pro Leu lie Gin Ser Arg lie Val Gly Gly Trp Glu Cys Glu Lys 
1 5 10 15 

CAT TOC CAA CCC TOG CAG GTG GCT GTG TAC AGT CAT GGA TOG GCA CAC 96 
His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His 
20 25 30 

TGT GGG GOT GTC CTG GTG CAC CCC CAG TGG GTG CTC ACA GCT GCC CAT 144 
Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His 
35 40 45 

TGC CTA AAG AAG AAT AGC CAG GTC TGG CTG GGT CGG CAC AAC CTG TTT 192 
Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe 
50 55 60 

GAG CCT GAA GAC ACA GGC CAG AGG GTC OCT GTC AGC CAC AGC TTC CCA 240 
Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro 
65 70 75 80 

CAC COG CTC TAC AAT ATC AGC CTT CTG AAG CAT CAA AGC CTT AGA CCA 288 
His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro 
85 90 95 

GAT GAA GAC TCC AGC CAT GAC CTC ATG CTG CTC CGC CTG TCA GAG CCT 336 
Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro 
100 105 no 

GCC AAG ATC ACA GAT GTT GTG AAG GTC CTG GGC CTG CCC ACC CAG GAG 384 
Ala Lys He Thr Asp Val Val Lys Val Leu Gly leu Pro Thr Gin Glu 
115 120 125 

CCA GCA CTG GGG ACC ACC TGC TAC GCC TCA Q3C TGG GGC AGC ATC GAA 432 
Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu 
130 135 140 

CCA GAG GAG TTC TTG CGC CCC AGG AGT CTT CAG TGT GTG AGC CTC CAT 480 
Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His 
145 150 155 160 

CTC CTG TCC AAT GAC ATG TGT GCT AGA GCT TAC TCT GAG AAG GTG ACA 528 
Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr 
165 170 175 
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GAG TTC ATG TTG TGT GCT GGG CTC 1X33 ACA GGT GGT AAA GAC ACT TGT 576 
Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys 
180 185 190 

GGG GGT GAT TCT GGG GGT OCA CTT GTC TGT AAT GGT GTG CTT CAA GGT 624 
Gly Gly Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly 
195 200 205 

ATC ACA TCA TGG GGC OCT GAG CCA TGT GCC CTG OCT GAA AAG OCT GCT 672 
lie Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala 
210 215 220 

GTG TAC ACC AAG GTG GTG CAT TAC CGG AAG TGG ATC AAG TAC ACC ATC 720 
Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp lie Lys Tyr Thr lie 
225 230 235 240 

GCA GCC AAC CCC TGAGTGCCCC TGTCCCACCC CTACCTCTAG TAAA 766 
Ala Ala Asn Pro 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Pro Leu He Gin Ser Arg He Val Gly Gly Trp Glu Cys Glu Lys 
15 10 15 

His Ser Gin Pro Trp Gin Val Ala Val Tyr Ser His Gly Trp Ala His 
20 25 30 

Cys Gly Gly Val Leu Val His Pro Gin Trp Val Leu Thr Ala Ala His 
35 40 45 

Cys Leu Lys Lys Asn Ser Gin Val Trp Leu Gly Arg His Asn Leu Phe 
50 55 60 

Glu Pro Glu Asp Thr Gly Gin Arg Val Pro Val Ser His Ser Phe Pro 
65 70 75 80 

His Pro Leu Tyr Asn Met Ser Leu Leu Lys His Gin Ser Leu Arg Pro 
65 90 95 

Asp Glu Asp Ser Ser His Asp Leu Met Leu Leu Arg Leu Ser Glu Pro 
100 105 110 

Ala Lys He Thr Asp Val Val Lys Val Leu Gly Leu Pro Thr Gin Glu 
115 120 125 

Pro Ala Leu Gly Thr Thr Cys Tyr Ala Ser Gly Trp Gly Ser He Glu 
130 135 140 

Pro Glu Glu Phe Leu Arg Pro Arg Ser Leu Gin Cys Val Ser Leu His 
145 150 155 160 
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Leu Leu Ser Asn Asp Met Cys Ala Arg Ala Tyr Ser Glu Lys Val Thr 
165 170 1"5 

Glu Phe Met Leu Cys Ala Gly Leu Trp Thr Gly Gly Lys Asp Thr Cys 
180 165 190 

Glv Glv Asp Ser Gly Gly Pro Leu Val Cys Asn Gly Val Leu Gin Gly 
IBS 200 205 

He Thr Ser Trp Gly Pro Glu Pro Cys Ala Leu Pro Glu Lys Pro Ala 
210 215 220 

Val Tyr Thr Lys Val Val His Tyr Arg Lys Trp He Lys Tyr Thr lie 
225 230 235 240 

Ala Ala Asn Pro 



(2) INFORMATION FOR SBQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STFANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TATACATATG TGGGACCT3G TTCTCTCC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AIAIGGATCC TCAGGGGITG GCTGCGATOG T 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC7IH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Trp Asp Leu Val Leu Ser He Ala Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Trp Phe Leu Val Leu Cys Leu Ala Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: ami no acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Trp Val Pro Val Val Phe Leu Thr Leu 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

lie Val Gly Gly Trp Glu Cys Glu Lys His Ser Gin Pro Trp Gin Val 
1 5 10 

Ala Val Trp Ser His Gly Trp Ala His Cys Gly Gly Val Leu Val His 
20 25 30 

Pro Gin Trp Val Leu Thr Ala Ala His Cys Leu Lys Lys Asn Ser Gin 
35 40 45 

Val Trp Leu Gly Arg His Asn Leu Phe Glu Pro Glu Asp Thr Gly Gin 
50 55 60 

Arq Val Pro Val Ser His Ser Phe Pro His Pro Leu Tyr Asn Met Ser 
65 70 75 SO 

Leu Leu Lys His Gin Ser Leu Arg Pro Asp Glu Asp Ser Ser His Asp 
85 90 95 

Leu Met Leu Leu Arg Leu Ser Glu Pro Ala Lys He Thr Asp Val Val 
100 105 11° 

Lys Val Leu Gly Leu Pro Thr Gin Glu Pro Ma Leu Gly Thr Thr Cys 
115 120 125 

Tyr Ala Ser Gly Trp Gly Ser He Glu Pro Glu Glu Phe Leu Arg Pro 
130 135 140 

Arq Ser Leu Gin Cys Val Ser Leu His Leu Leu Ser Asn Asp Met Cys 
145 150 155 160 

Ala Arq Ala Tyr Ser Glu Lys Val Thr Glu Phe Met Leu Cys Ala Gly 
165 170 175 

Leu Trp Thr Gly Gly Lys Asp Thr Cys Gly Gly Asp Ser Gly Gly Pro 

Leu Val Cys Asn Gly Val Leu Gin Gly He Thr Ser Trp Gly Pro Glu 
195 200 205 

Pro Cvs Ala Leu Pro Glu Lys Pro Ala Val Tyr Thr Lys Val Val His 
210 215 220 

Tyr Arq Lys Trp He Lys Asp Thr He Ala Ala Asn Pro 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cPNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATATQGATCC ATATGTCAGC ATGTGGGACC TGGTTCTCrC CA 42 



(2) INFORMATION FOR SEQ ID NO: IB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEEMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 18: 

Val Pro Leu He Gin Ser Arg He Val Gly Gly Tip Glu 
15 10 
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WHAT IS CLAIMED IS: 

1 . Isolated, substantially homogeneous pre-pro hK2 polypeptide. 
5 2. Isolated, substantially homogenous pro hK2 polypeptide. 

3. Isolated, substantially homogenous mature hK2 polypeptide. 

4. An antibody that is capable of specifically binding the hK2 polypeptide 
10 of claims 1, 2 or 3 and which does not bind to hK3. 

5. The antibody of claim 4 which is a monoclonal antibody. 

6. A hybridoma cell line producing the antibody of claim 5. 

15 

7. An isolated nucleic acid molecule encoding the polypeptides of claim 
1,2 or 3. 

8. An isolated nucleic acid molecule selected from the group consisting 
20 of 

(a) cDNA comprising the nucleotide sequence of the coding region 
of the hK2 gene; 

(b) DNA capable of hybridizing under stringent conditions to a 
nucleotide sequence complementary to the nucleotide sequence 

25 of (a); and 

(c) a genetic variant of any of the DNA of (a) and (b) which 
encodes a polypeptide possessing an antigenic function of 
naturally occurring hK2 polypeptide. 



30 9. 



The nucleic acid molecule of claim 8 further comprising a promoter 
operably linked to the nucleic acid molecule. 
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10. A chimeric expression vector comprising the nucleic acid molecule of 
claim 7 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

5 11. The vector of claim 10 wherein the host cell is E*_fifili. 

12. The vector of claim 10 wherein the host cell is a mammalian cell. 

13. A host cell transformed with the vector of claim 10. 

10 

14. The host cell of claim 13 which is fL_£oii. 

15. The host cell of claim 13 which is mammalian. 

15 16. A method of using a nucleic acid molecule encoding a hK2 

polypeptide comprising expressing the nucleic acid molecule of claim 
7 in a cultured host cell stably transformed with a chimeric vector 
comprising said nucleic acid molecule operably linked to control 
sequences recognized by the host cell transformed with the vector, and 

20 recovering hK2 polypeptide from the host cell. 

17. The method of claim 16 wherein the host cell is JLcoli. 

18. The method of claim 16 wherein the host cell is mammalian. 

25 

19. The method of claim 16 wherein the nucleic acid molecule is DNA 

20. The method of claim 16 wherein the hK2 polypeptide is recovered 
from the host cell culture medium. 
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BanHl <-i j — ► PPbK3 |~* pbK2 

1 G<»TC<WaTCTOOOACCTQGTTCTC^ 

OTWK^ACACCCTCXaAC^^ 

1» H etTriUUpL«ttValI*xifl«rIl«ja«l-uSerVal01yCy«TlirOlyAlaValPr 

I — > ahK2 

£6 CCTCATCCAOTCTCaOATTOTOGOAfl^ 
19 - o^laQlaSarArgllaValOlyGlyTrpO^ 
131 CTGTOTACAOTCATGaATGGGCACACTGTGGQOO^ 
41* laValTyrSarHiaOlyTrpAlaHieCy-aiyGlyValL^ 

2£^Aa1g1aT^ 
6 3 - AlaAlaHi.Cy aLauLy B LyaX«iiSerOlaValTrpI^u0XyArgSiaA«nI^tiPhaOluProal 

a/sir^flfigCMJlGOCTCCCTOTC^ 

^G^CCGOTCTCCCAOGGACAGTC 

spThrGlyGlnAroValProValSarBiaS«rPh.ProHi«Prol^uTyrA«xiMetS»rL«uL 



84- uASI 



«fi TOAAGCXTCAAAGCCTTAGACCXOATGAAGACTCCX^^ 
106- aW»HiaGlnSerI*uia:gPrQJlapaluA*pSart^^ 
391 gxGCCTGCCAAQATCACAOATGTTGTGAAQGTCCTOGGCCTGCCCA 
128 - Gi,iProAlal.y»Il«ThrAapValValLyaValI*uQl^ 

456 GJUICACCTGCTJKOCCTCA^ 

CTGGTOOACGATCCQaAGTCCOACCCCQTCOT^^ 
149* yThrthrCyaTyrAlaflarOlyTrpOlySarll^^ 

Iagtcaca^^ 

17 1 - «uGlnCy»ValSarLeuHiaL«uLauSarX»nA«pMatCy«AlaArgAaaTyr SarOluLy sVal 

SB6 ACAGAOTTCXTGTTGTOTOCTGaOCTCTOGACXO^ 
TGTCTCAAOTACAACACACaACCCaAOACCTGTC^ 

ThrGluPhaMati^uCyaAla01yLauTrpThrGlyOXyLyaA*^ThrCy»OlyOXyAflpSa 

/M<ypcCACTTGTCTGTAATaOTOTGCTTCAAGOTXTCACXTCXTGGGGCCC 

CCCXGGTOXJWauaaU^TTACCACACGXX^ 
214- yoiyProI^uValCyaJ^nalyValLauOlnGlyllaThrSartrpOXyProOluProCyaAlal. 

716 TGCCTGAXXXflCCTGCTOTOTACACCAAGGTOaTGCATTACCOGJA 

j^CGGACTTTTCaGACGACACXTGTGGTTCCACCACGTAXTOGCCTTCACCTAGTTCATGTGGTAG 
236- •uPro01uLy«ProJaaValTyr^LysV&lValHlaTyrArgLy^TrpIleLysTyrThrIle 

781 GCAGCCAACCCCTOAGTGCCCCTGTCCCACCCCTACCTCTAOTAAACTOCAG 
COTCGOTTOGGGACTCACOGGGACAOGGTGGGGATGOAGATCATTTGACOTC 

258- AlaAlaAsnPro pstl 



193- 
651 
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1 GAATTCAT<»TTQT<WGAqGCTQQGAQTOTGAaAAaCXTTCC CAA CCC 
CTTAAOIACTAJICACCCTCCaXCCCTCACACTCTTCCTAAOOOTTOGO 
1 ► MatIl«Val01y01yTrpOluCyf01uLysHi«S«rOliiPro 

49 TQQCitQOTQCCTOT Q TXCAOTCATOQATQTO 

ACCOTCCACCOXCACATOTCAOTACCTACCCG^ 
15 - Trp01»ValAl*ValTyrSerHi«OlyTrpAl«Hi»Cy«01y<3lyVal 

97 CTQQTGCACCCCCAOTQGQTQCTCAjCAOCT<^CCATTOCCT AAAQAAg 
aACCACOTOaaOOTCACCCXCOAaTOTCaXCOOOTAACaGATTTCTTC 
3 1 ► LauValHi BProOlnTrpValLauThrAlaAlaH±»CyaL«uLy»Ly« 

14 S AATAQCCAO«TCT0aCT©aaTC0aC»CAACCTaTTTC^^ 

TTATCOOTCCAOXCCGACCCATCCOTOTTtXJACAAXCTCaaXCTTCTO 
47 * AsnSer01nValTrpL«uaiyArffHi»AanLauPba01uPro01uAap 

193 ACAOOCCAGAOGOTCCCTOTCAOCCACAOCTTCCCACACCCOCTCTAC 
TOTCCO<JTCTCCCAOOOACAOTCO<JTOTCGAAOO<3TGTaOCCGAflATO 
63 ► ThrOlyainArgValProValS«rBi«SarPhaProHi«ProL«uTyr 

241 AATATGAOCCTTCTQJUlOCATCAAAIKCTTAaACCAGATQAAQJlCTCC 
TTATACTCGaAAQACTTCQTAOTTTCOQAATCTOOTCTACTTCTOAOO 
79 ► AsnM«tSerL«uX.auI.ysBiflOlnS«rL«uArgProAsp01uA«pSar 

289 AaCCATOACCTCATOCTOCTCCCCCTCTCAaAOCCTGCCAAaATCACA 
TCOOTACTOGAOTACGACOAOTOOQACAQTCTCaQACaOTTCTAOTOT 
95- s«rBi«AspL«uM«tL«uL«uArgL«uSarOluProAlaLysXl«Thr 

337 QAT<nPTOT<aAAOOTCCT<X^CTaCCCACCCAGaAOCCAaCACTOOOO 
CTACAACACTTCCAGOACCCOGACOOOTOOOTCCTCOOTCaTGACCCC 
IX i ► AspValValLy »ValL«uOlyI»«uProTlir01n01uProAlaL«uOly 

365 ACCACCTOCTAC<3CCTCACOCT<3<MOCAOCATCaAACCAaAOflAOTTC 
TOaTOOACaATOCOGAOTCCaACCCCOTCOTAOCTTOOTCTCCTCAAO 
127 ► ThrThrCy»Tyr AlaSarQlyTrpOlySarllaQluProOluOluPha 

433 TTOCOCCCCAOOACTCTTCAOTOTOTaAaCCTCCATCTCCTGTCCAAT 
AACaCGOOOTCCTCAaAAGTCACACACTCOaAOOTAaAOaACAOOTTA 
143 » L«uArgProArgS«rL«u01nCysValSarL«uHisLauI^uS«rAan 

461 QACATOTOTQCTAQAGCTTACTCTOAOAAQGTOACA 

CTOTACACACaATCTCOAATGAOACTCTTCCACTOTCTCAAOTACAAC 
159 ► AspMatCysAlaArgAlaTyrSvrOluLysValThrOluPhaMatLau 

529 TOTOCTOOOCTCTOOACAOOTOOTAAAOACACTTaTOGOOOTOATTCT 
ACACOACCCOAOACCTOTCCACCATTTCTOTOAACACCCCCACTAAaA 
175 » cy«Ala01yliattTrpThrOlyaiyLy»A«pThrCy«OlyOlyAapSar 

577 OOOOGTCCACTTOTCTOTAATOOTaTOCTTCAAOGTATCACATCATOO 
TOCCCAOQTQAACAQ A CATTACCACACaAAqCTCCATAOTQTAO TACC 
191* 0ly0lyProI#«uValCyBA«n0lyvalLau0ln01yI l«Thr SarTrp 

625 GGCCCTGAOCCATOTOCCCTaCCTOAAAAGCCTOCTGTOTACACCAAO 
CCaOOACTCOaTACACGOCACOaACTTTTCOOACOACACATOTOOTTC 
2 07 ► oiyProQluProCyaAl ai«uPro<aiuLy«PraAlaValTyrThrLy« 

673 OTOQTOCATTACCGQAAOTOOATCAAOTACACCATCOCAGCCAACCCC 
CACCACOTAATGOCCTTCACCTAGTTCATOTOaTAOCCTCOO'TTOOOO 
223* Val ValHlsTyrArgLy aTrpI laLy ■TyrThr XlaAlaAlaAsaPro 

721 TQAQTOCCCCTQTCCCACCCCTACCTCTAQT AAA CTOCAO 
ACTCACGGGGACAGGGTGGGOA7GOAGATCATTTaACGTC 



FIG. 6 



SUBSTITUTE SHEET (RULE 26) 



WO 95/30758 



7/12 



PCT/US95/06157 



1 GTGCCCCTCATCCAGTCTCaOATTQTGaOAaCCTCGCAGTGTO^ 

CACGGOGAGTAGGTauaUXCTAACACCCTCCGAjCCC^^ 
1 ► ValProL«uIl«01nS«rArffIl«Vaiaiyt31yTrpGluCyflCluLyaHi«SerOlnPro 

61 TGaCAGOTOaCTOTOTACJUTTCATGa^TOaGC&CACTCntSGG 

ACCGTCCACCQXCACATGTCAOTACCTACCCOTOTOACACCCCCA^ 
21*- Trp01nV«lAl«ValTyrS«rHis0lyTxpAI«BisCy801yOlyVAlL«uValBisPro 

121 CAGTGGGTGCTCACAGCTGCCCATTGCCTAAAGAAGAATAGCCAGGTCTGGCTGG 

GTCXCCCACGAGTOTCaACGOOTAACQaXTTTCTTCTTATCGGTCCAaACCaACCCAaCC 
41- GlnTrpValLauThxAlaAlaBlsCyaLauLyaLyaAanSarGlnValTxpl^uGlyAx? 

181 CACAACCTGTTTGAGCCTQAAGACACAGGCCAaAGGaTCCC^^ 

GTGTTGGACAAACTCGGACTTCTOTGTCCGGTCTCCCAGGaACAGTCG^ 
61- BiaAsaL«uPhaaluProGluA8pThrGlyGlaArgValProValS«rHisS0rPh«Pro 

241 CACCCGCTCTACAATATaAOCCTTCTGJUlOCATCAAAGCCTTAGACaia^ 
STSGGCGAX^TSTTATAwaCGGAAGA^a-^GTA*^ 
81*- Hi«FroL«uTyrA*nM«t S«rI*uI*uLy8HisainS«rL«uArgFroA«pGluAfipSar 

301 AGCCATGACCTGATGCTGCTCCGCCTGTGAOAOCCTCCCAAOATCACAa^ 

TCOOTACTOGAOTACOACOAOOCOOACAOTCTCOaACOOTTCT 
101»- sarBiaAflpL«uM«tL«uIi«uArgLauS«r01uProAlaLysZlaTlLEA«pValValLya 

361 qTCCTGGqCCTOCCCACCCJUaQAGCCAQCA CT OOGGACCACCT^ 

CAQGACCCGOACaGaTOaGTCCTCOGTCGTGACCCCTGGTCOACGATOCQGAff^^ 
12 1 ► ValZ«au01yXi«uProTfarainaiuProAlaL«uOlyThzTarCysTyrAlaS«rGly7xp 

421 GQCAQCATCQAACCAOAGQA U ' f T CVfU CQCCCCAjBOAQTCTTO 

CCGTCOTAGCTTOGTCTCCTCAAGAACGCGGGOTCCTCACAAOTCACACACTC 
141- QlySarXlaGluProGluOluPhaLauArgProAraSarLauGlnCysValSarlfauBia 

481 CTCCTGTCCAATOACATGTGTGCTAaAGCTTACTCTGAGAAGGTOACAGAGTTCATGT^ 

GAOaACAGOTTACrOTACACACGATCTCaAATGAGACTCTT^ 
161- L«uLauSarA«nAapM«tCyaAlaArgAlaTyrS«rGluLysValThrGluPtaaMatL«u 

541 yQTQCTOqQCTCTQQACAgQTQOTAAA Q ACA C TTQTOQGQCTQATTCTQQ 

ACACGACCCGAGJkCCTOlCCACCATTTCTOTaAACACCCCCACTAAGACCCC^ 
181- CysAlaGlyLauTxpThrGlyGlyZiyaAspThrCysOlyGlyAapSarOlyGlyProliau 

601 QTCTQTAATQGTQT GC TTCAAQOTATCACATCATQGQQCCCTqAQCCATQTQCCCTQCCT 
CAQACATTACCACACGAAG^TCCATAGTGTAGTACCCCGGaACTCQOTACACGOGACGOA 

201- ValCyaAsnGlyValLauGlaOlyllaThrSaxTrpGlyProGluProCy aAlaLauPro 

661 GAAAAGCCTGCTGTOTACACCAAGGTGGTGCATTACCQGAAGTGGATCAJ^ 

CTTTTCOGACGACACATGTGGTTCCACCACaTAATGGCCTTCACCTAGTTCATGTGCT 
221- GluLysPraAlaValTyvThrLyaValValBisTyrArgLyaTrpIlaLysTyxThrlla 

721 OCAGCaU^CCTaAOTOCCCCTOTCCCJUKCCTACCTOTA^ 

CGTCGGTTGGGGACTCACGGGGACAGGGTGGGGATGGAGATCATTT 
241- AlaAlaAanPro 
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